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5 BACKGROUND OF THE INVENTION 

The presence of humoral immunity (circulating antibodies) to adenovirus 
capsid proteins is a barrier to the use of adenovirus vectors for gene therapy. The 
prototype adenovirus vectors that have been developed for gene therapy are based on 
subgroup C adenoviruses such as that of serotype 5. The prevalence of neutralizing 

10 antibodies against subgroup C adenoviruses is generally high in human populations as 
a result of frequent exposure to these pathogens. This fact is likely to greatly limit the 
eflfectiveness of gene therapy vectors based on serotypes such as Ad5, 

Analysis of the nature of the protective antibodies against adenoviruses has 
indicated that the most important target is the major capsid protein, hexon [Wolfhart 

15 (1988)J. f^/ro/ 62, 2321; Gall a/. (1996)y. Firo/. 70, 2116]. Several efforts have 
been made to engineer the hexon so as to evade the anti-hexon antibodies by making 
chimeric adenoviruses harboring hexons from other serotypes [Roy et aL (1998) J. 
Virol. 72, 6875; US Patent No 5922315; Gall et aL (1998)7. ViroL 72, 10260; Youil 
etal. (2002) Hum. Gene Ther. 13, 31 1; Wu etaL (2002)7. ViroL 76, 12775]. 

20 However, this has been largely unsuccessfiil when exchanges among distant serotypes 
are attempted. 

Alternatively, investigators have proposed using adenovirus vectors that 
rarely cause human infections or using adenoviruses from non-human sources. 
However, the lack of a practical manner in which to produce large numbers of such 
25 vectors has proved to be a hindrance to developing such vectors. 

SUMMARY OF THE INVENTION 

The present invention provides a method of modifying adenoviruses having capsids, 
30 and particularly, including hexons, from serotypes which are not well adapted for 
growth in cells useful for adenoviral virion production. The method is useful for 
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production of scalable amounts of adenoviruses. The modified, or chimeric, 
adenoviruses are useful for a variety of purposes which are described hetein. 
The invention further provides novel, isolated, adenovirus SA18 nucleic acid and 
amino acid sequences, vectors containing same, cell lines containing such SA18 
5 sequences and/or vectors, and uses thereof. 

Other aspects and advantages of the present invention will be readily apparent from 
the following detailed description of the invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 
10 Fig. 1 provides the map of the genome of the simian adenovirus generated 

by shotgun cloning as described in the ocamples below. 

Fig. 2 provides the map of the recombinant Adhu5-SV25 chimeric virus. 
The portion of the genome replaced by DNA fiom Pan? is indicated, 

15 DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides chimeric adenoviruses composed of the left 
terminal end and right terminal end of an adenovirus which can be cultured in the 
selected host cell, and the internal regions encoding, at a minimum, the capsid 
proteins of another adenovirus serotype. This invention is particularly advantageous 

20 for generating adenoviruses having serotypes which are difficult to culture in a - 
desired cell type. The invention thus pennits generation of chimeric adenoviruses of 
varying serotypes. 

In the embodiments illustrated herein, chimeric adenoviruses have been 
constructed where most structural proteins, and not merely the hexon or fiber, are 

25 derived from an adenovirus of an unrelated serotype, thereby preserving the majority 
of the protein-protein interactions that are Involved in capsid assembly. Most of the 
early genes such as those encoded by the adenovirus El and B4 regions that are 
responsible for transcription regulation and regulation of the host cell cycle, are 
retained from a difFerent serotype that is known to result in high titer virus generation 

30 in the commonly used cell types, such as HEK 293 which supplies the Ad5 EI 
proteins in trans. 

In another embodiment, the invention provides novel nucleic acid and amino 
acid sequences firom Ad SA18, which was originally isolated fiom vervet monkey 
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[ATCC VR.943]. The present invention further provides novel adenovirus vectors 
and paclcaging cell lines to produce those vectors for use in the in vitro production of 
recombinant proteins or fragments or other reagents. The invention further provides 
compositions for use in delivering a heterologous molecule for therapeutic or vaccine 
purposes. Such therapeutic or vaccine compositions contain the adenoviral vectors 
canymg an inserted heterologous molecule. In addition, novel sequences of the 
mventlon are useful In providing the essential helper functions required for 
production of recombinant adeno-associated viral (AAV) vector. Thus, the invention 
P«,vldes helper constructs, methods and cell lines which use these sequences in such 
production methods. 

The tem, "substantial homology" or "substantial similarity," when referring to 
a nucleic acid or fragment thereof, indicates that, when optimally aligned with 
appropriate nucleotide Insertions or deletions with another nucleic acid (or its 
complementary strand), there is nucleotide sequence identity in at least about 95 to 
99% of the aligned sequences. 

The term "substantial homology" or "substantial similarity." when referringlo 
ammo acids or fragments thereof, indicates that, when optimally aligned with 
appropriate amino acid insertions or deletions with another amino acid (or its 
complementary strand), there is amino acid sequence identity in at least about 95 to 
990/0 of the aligned sequences. Preferably, the homology is over full-length sequence 
or a protein thereof, or a fragment thereof which is at least 8 amino acids, or more 
desiiably. at least 15 amino acids in length. Examples of suitable fragments ate 
described herein. 

The term "percent sequence identity" or "identical" in the context of nucleic 
acid sequences refers to the residues in the two sequences that are the same when 
aligned for maximum correspondence. The length of sequence identity comparison 
may be over the full-length of the genome (e.g.. about 36 kbp). the full-length of an 
open reading fteme of a gene, protein, subunit. or enzyme [see. e.g., the tables 
providing the adenovimi coding regions], or a ft^gment of at least about 500 to 5000 
nucleotides, is desired. However, identity among smaller fragments, e.g.. of at least 
about nine nucleotides, usually at least about 20 to 24 nucleotides, at least about 28 to 
32 nucleotides, at least about 36 or more nucleotides, may also be desired. Similarly 
"percent sequence identity" may be readily determined for amino acid sequences 



over the fuU-Iength of a protein, or a fragment thereof. Suitably, a fragment is at least 
about 8 amino acids in length, and may be up to about 700 amino acids. Examples of 
suitable fragments are described herein. 

Identity is readily determined using such algorithms and computer programs 
S as are defined herein at default settings. Preferably, such identity is over the full 
length of the protein, enzyme, subunit, or over a fragment of at least about 8 amino 
acids in length. However, identity may be based upon shorter regions, where suited to 
the use to which the identical gene product is being put. 

As described herein, alignments are performed using any of a variety of . 
10 publicly or commercially available Multiple Sequence Alignment Programs, such as 
"Clustal W", accessible through Web Servers on the internet. Alternatively, Vector 
NTI utilities are also used. There are also a number of algorithms known in the art 
that can be used to measure nucleotide sequence identity, including those contained in 
the programs described above. As another example, polynucleotide sequences can be 
IS compared using Pasta, a program in GCG Version 6.1 . Pasta provides alignments and 
percent sequence identity of the regions of the best overlap between the query and 
search sequences. For instance, percent sequence identity between nucleic acid 
sequences can be determined using Pasta with its default parameters (a word size of 6 
and the NOPAM factor for the scoring matrix) as provided in GCQ Version 6.1, 
20 herein incorporated' by reference. Similarly programs are available for performing 
amino acid alignments. Generally, these programs are used at default settings, 
although one of skill in the art can alter these settings as needed. Alternatively, one of 
skill in the art can utilize another algorithm or computer program that provides at least 
the level of identity or alignment as that provided by die referenced algorithms and 
23 programs. 

As used throughout this speciiftcation and the claims, the term "comprise*' and 
its variants including, "comprises'*, "comprising", among other variants, is inclusive 
of other components, elements, integers, steps and the like. The term "consists of or 
"consisting of are exclusive of other components, elements, integers, steps and the 
30 like. 

Except where otherwise specified, the term "vector" includes any genetic 
element known in the art which will deliver a target molecule to a cell, including, 
naked DNA, a plasmid, phage, transposon, cosmids, episomes, viruses, etc. 
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By "minigene" is meant tlie combination of a selected heterologous gene and 
the other regulatory elements necessary to drive translation, transcription and/or 
expression of the gene product in a host cell. 

As used herein, the term "transcomplement" refers to when a gene (gene 
product) of one adenovirus serotype supplies an adenovirus serotype lacking this gene 
(gene product) from another serotype with the missing function. For example, human 
adenovirus serotype 5 Ela and Elb functions are known to transcomplement El- 
deleted chimpanzee adenovirus Pan 9. Similarly, the inventors have found that 
human Ad5 El transcomplements El-deleted chimpanzee adenovirus serotypes Pan5, 
Pane. Pan7. and simian adenovirus serotypes SVI, SV25 and SV39. Other examples 
of transcomplementing serotypes include human Ad5 and human Ad2. Ad3. Ad4, 
Ad5.Ad7.andAdl2. 

- The term "fimctionally deleted" or "functional deletion'.' means that a 
sufficient amount of die gene region is removed or otherwise damaged, e.g., by 
mutation or modification, so that the gene region is no longer capable of producing 
functional products of gene expression. If desired, the entire gene region may be 
removed. Other suitable sites for gene disruption or deletion are discussed elsewhere 
in the application. 

The term 'functional" refers to a product (e.g., a protein or peptide) which 
performs its native function, although not necessarily at the same level as the native 
product The term "functional" may also refer to a gene which encodes and from 
which a desired product can be expressed. 

(.Chimeric Adenoviral Vectors 

The compositions of this invention include chimeric adenoviral vectors that 
deliver a heterologous molecule to cells. For delivery of such a heterologous 
molecule, the vector can be a plasmid or. preferably, a chimeric adenovirus. The 
chimeric adenoviruses of the invention include adenovirus DNA from at least two 
source serotypes, a "donating serotype" and a "parental adenovirus" as described in 
30 more detail herein, and a mini^ne. 

Because the adenoviral genome contains open reading frames on both strands, 
in many instances reference is made herein to 5' and 3' ends of the various regions to 
avoid confusion between specific open reading frames and gene regions. Thus, when 
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reference is made herein to the "left** and "righf ' end of the adenoviral genome, this 
reference is to tfie ends of the approximately 36 kb adenoviral genome when depicted 
in schematic form as is conventional in the art [see, e,g., Honvitz, '*Adenoviridae and 
Their Replication", in Virology, 2d ed., pp. 1679-1721 (1990)]. Thus, as used 

5 herein, the "left terminal end" of the adenoviral genome refers to portion of the 

adenoviral genome which, when the genome is depicted schematically in linear form, 
is located at the extreme left end of the schematic. Typically, the left end refers to be 
portion of the genome beginning at map unit 0 and extending to the right to include at 
least the 5' Inverted terminal repeats (ITRs), and excludes the internal regions of the 

10 genome encoding the structural genes. As used herein, the "right terminal end" of the 
adenoviral genome refers to portion of the adenoviral genome which, when the 
genome is depicted schematically in linear form, is located at the extreme right end of 
the schematic. Typically, the right end of the adenoviral genome refers to be portion 
of the genome ending at map unit 36 and extending to the left to include at least the 3* 

15 ITRs, and excludes the internal regions of the genome encoding the structural genes. 
A.Adenovirus Regulatory Sequences 
1. Serotype 

The selection of the adenovirus serotype donating its left 
terminal end and right terminal end can be readily made by one of skill in the art from 

20 among serotypes which can readily be cultured in the desired cell line. Among other 
fectors which may be considered in selecting tfie serotype of the donating serotype is 
compatibility with the adenovirus serotype which will be supplying the internal 
regions at tfie location at which their sequences are hybridized. 

Suitable adenoviruses for donating their left and right termini 

25 are available ftom the American Type Culture Collection, Manassas, Virginia, US 
(ATCC), a variety of academic and commercial sources, or the desired regions of the 
donating adenoviruses may be synthesized using known techniques with reference to 
sequences published in the literature or available fix)m databases (e.g., GenBank, 
etc.). Examples of suitable donating adenoviruses include, without limitation, human 

30 adenovirus serotypes 2, 3, 4, 5, 7, and 12, and ftirther including any of the presently 
identified human types [see, e.g., Horwitz, "Adenoviridae and Their Replication", in 
Virology, 2d ed., pp. 1679-1721 (1990)] which can be cultured in the desired cell. 
Similarly adenoviruses known to infect non-human primates (e.g., chimpanzees, 



rhesus, macaque, and other simian species) or other non-human mammals and which 
grow in the desired cell can be employed in the vector constructs of this invention. 
Such seroQ^pes include, without limitation, chimpanzee adenoviruses Pan S [VR-S91], 
Pan6 [VR.592], Pan? [VR-593], and C68 (Pan9), described in US Patent No. 
6,083,716; and simian adenovirases including, without limitation SVl [VR-19S]; 
SV25 [SV.201]; SV35; SV15; SV-34; SV-36; SV.37, and baboon adenovirus [VR- 
27S], among others. In the following. examples, the human 293 cells and adenovirus 
type S (AdS), Pan9, and Ad40 are used for convenience. However, one of skill in the 
art will understand that other cell lines and/or comparable regions derived from other 
adenoviral strains may be readily selected and used in the present invention in the 
place of (or in combination with) these serotypes. 
2. Sequences 

The minimum sequences which must be supplied by the 
adenovirus donating its left terminal end and its right terminal end include the S' cis- 
elements and the 3' cis-elements necessary for replication and packaging. Typically, 
the 5' cis-elements necessary for packaging and replication include the S' inverted 
terminal repeat (ITR) sequences (which functions as origins of replication) and the 
native S' packaging enhancer domains (that contain sequences necessary for 
packaging linear Ad genomes and enhancer elements for the El promoter). The right 
end of the adenoviral genome includes the 3' cis-elements (including the ITRs) 
necessary for packaging and encapsidation. Desirably, the adenovirus serotype 
donating its left and rig^t termini and/or an adenovirus serotype which 
transcomplements the serotype of the donating adenovirus, further provides the 
functions of the necessary adenovirus early genes, including El (El a and Elb), E2 
(E2a and E2b), and E4 (including at least the 0RF6 region). E3 is not essential and 
may be deleted as desired, e.g., for insertion of a transgene in this region or to provide 
space for a transgene inserted in anodier region (typically for packaging it is desirable 
for the total adenoviral genome to be under 36 kb). 

In certain embodiments, the necessary adenovirus early genes are 
contained in the chimeric construct of the invention. In other embodiment, one or 
more of the necessary adenovirus early genes can be provided by the packaging host 
cell or in trans. 

7 
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In general, the chimeric adenovirus of the invention contains 
regulatory sequences firom the donating adenovirus serotype, or a transcomplementtng 
serotype, to provide the chimeric adenovirus with compatible regulatory proteins. 
Optionally, one or more of the necessary adenoviral structural genes is provided by 
the adenovirus donating its left terminal and its right terminal end. 

In certain embodiments, the chimeric adenovirus further 
contains one or more functional adenovirus genes, including, the Endoprotease open 
reading frame, DNA binding protein, 100 kDa scaffolding protein, 33 kDa protein, 
protein VllI, pTP, 52/55 kDa protein, protein VII, Mu and/or protein VI firom the 
adenovirus serotype donating its left and right termini. Where all of these genes are 
derived from the adenovirus serotype donating the 5* and 3* ITRs, a "pseudotyped" 
chimeric is formed. Optionally, one or more of the genes can be hybrids formed from 
the fusion of the donating adenovirus serotype and the parental adenovirus serotype 
providing the capsid proteins (e.g., without limitation, polymerase, terminal protein. 
Ilia protein). Suitably, these genes express functional proteins which permit 
packaging of the adenovirus genes into the capsid. Alternatively, one or more of 
these proteins (whether hybrid or non-hybrid) can be functionally deleted in the 
chimeric adenovirus. Where desired, any necessary proteins functionally deleted in 
the chimeric adenovirus can be expressed in trans in tiie packaging cell. 
B. Parental Adenovirus Structural Proteins 
1. Serotypes 

This invention is particularly well adapted for use in 
generating chimeric adenoviruses in which the capsid proteins are from a parental 
adenovirus which does not efficiently grow in a desirable host cell. The selection of 
the parental adenovirus serotype providing the internal regions can be readily made 
by one of skill in the art based on the information provided herein. 

A variety of suitable adenoviruses can serve as a parental 
adenovirus supplying the regions encoding ttie structural (i.e., capsid proteins). Many 
of such adenoviruses can be obtained from the same sources as described above for 
the donating adenovirus serotypes. Examples of suitable parental adenovirus 
serotypes includes, without limitation, human adenovirus serotype 40, among others 
[see, e.g., Horwitz, ''Adenoviridae and Their Replication", in Virology, 2d ed., pp. 
1679-1 721 (1990)], and adenoviruses known to infect non-human primates (e.g., 

8 



chimpanzees, rhesus, macaque, and other simian species) or other non-human 
mammals, including, without limitation, chimpanzee adenovirus CI, described in US 
Patent No. 6,083,716; simian adenoviruses, and baboon adenoviruses, among others. 
In addition, the parental adenovirus supplying the internal regions may be from a non- 
5 naturally occurring adenovirus serotype, such as may be generated using a variety of 
techniques known to those of skill in the art. 

In one embodiment illustrated herein, a chimeric virus that was constructed 
was that between the chimpanzee adenoviruses Pan-5 and CI exhibited a higher titer 
in human 293 cells than the wild-type parental virus. However, the invention is not 
10 limited to the use of these chimpanzee adenoviruses, or to the combination of simian- 
simian, human-human, or simian-human chimeric adenoviruses. For example, it may 
be desirable to utilize bovine or canine adenoviruses, or other non-human mammalian 
adenoviruses which do. not naturally infecj pnd/or replicate in human cells. 

In die following examples, the human adenovirus type 40 
15 (Ad40) and the chimpanzee adenovirus CI, simian Pan 5 and Ad40, and Pan 5 and 
simian adenovirus SA18, are used. However, one of skill in the art will understand 
that other adenoviral serotypes may be readily selected and used In the present 
invention in the place of (or in combination with) these serotypes. 
2. Sequences 

The parental adenovirus provides to the chimeric construct of 
the Invention Its Internal regions which Includes structural proteins necessary for 
generating a capsid having the desired characteristics of the parental adenovirus. 
These desired characteristics include, but are not limited to. the ability to infect target 
cells and delivery a heterologous transgene. the ability to elude neutralizing 
antibodies directed to another adenovirus serotype (I.e.. avoiding clearance due to 
cross-reactivity), and/or the ability to Infect cells in the absence of an Immune 
response to the chimeric adenovirus. The advantages of such characteristics may be 
most readily apparent In a regimen which Involves repeat deliveiy of adenoviral 
vectors. The left and right termini of the parent adenovirus, including at least the 5' 
30 ITRs, the El region, the E4 region and the 3' ITRs are non-fiinctlonal and, preferably, 
completely absent. Optionally, all adenovirus regulatory proteins fiom this parental 
adenovirus are non-functional and only the structural proteins (or selected structural 
proteins) are retained. 
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At a minimum, the parental adenovirus provides the 
adenoviral late region encoding the hexon protein. Suitably, the parental adenovirus 
further provides the late regions encoding the penton and the fiber. In certain 
embodiments, all of the functional adenoviral late regions, including LI (encoding 
52/55 Da, Ilia proteins), L2 (encoding penton, VII, V, Mu proteins), L3 (encoding VI, 
hexon, Endoprotease), L4 (encoding 100 kD, 33 kD, VIII proteins) and LS (encoding 
fiber protein) are supplied by the parental adenovirus* Optionally, one or more of 
these late gene functions, with the exception of those encoding the hexon, penton and 
fiber proteins, can be functionally deleted. Any necessary structural proteins may be 
supplied in trans '. 

Thus, in certain embodiments, the chimeric adenovirus further 
contains one or more functional adenovirus genes, including, the Endoprotease open 
reading frame, DNA binding protein, 100 kDa scaffolding protein, 33 kDa protein, 
protein VIII, pTP, 52/55 kDa protein, protein VII, Mu and/or protein VI from the 
parental adenovirus donating its internal regions. Optionally, one or more of the 
genes can be hybrids formed from the fusion of the donating adenovirus serotype and 
the parental adenovirus serotype providing the capsid proteins, as described above. 
C. The "Minlgene- 

Typically, an adenoviral vector of the invention is designed to contain 
a minigene which may be inserted into the site of a partially deleted, fully deleted 
(absent), or disrupted adenoviral gene. For example, the minigene may be located in 
the site of such a functional El deletion or functional E3 deletion, or anotfier suitable 
site. 

The methods employed for the selection of the transgene, the cloning 
and construction of the "minigene" and its insertion into the viral vector are within the 
skill in the art given tiie teachings provided herein. 

1. The transgene 

The transgene is a nucleic acid sequence, heterologous to the 
vector sequences flanking the transgene, which encodes a polypeptide, protein, or 
other product, of interest. The nucleic acid coding sequence is operatively linked to 
regulatory components in a manner which permits transgene transcription, translation, 
and/or expression in a host cell. 
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The composition of the transgene sequence will depend upon 
the use to which the adenoviral vector will be put. For example, the adenoviral vector 
may be used as a helper virus in production of recombinant adeno-associated viruses 
or in production of recombinant adenoviruses deleted of essential adenoviral gene 
5 functions which are supplied by the adenoviral vector, or for a variety of production 
uses. Alternatively, the adenoviral vector may be used for diagnostic purposes. 

One type of transgene sequence includes a reporter sequence, 
which upon expression produces a detectable signal. Such reporter sequences 
include, without limitation, DNA sequences encoding p-lactamase, p-galactosidase 

1 0 (LacZ), alkaline phosphatase, thymidine kinase, green fluorescent protein (GFP), 
chloramphenicol acetyltransferase (CAT), luciferase, membrane bound proteins 
including, for example, CD2. CD4. CDS, the influenza hemagglutinin protein, and 
others well known. in the art. to which hi^. affinity antibodies directed thereto exist or 
can be produced by conventional means, and fusion proteins comprising a membrane 

1 5 bound protein appropriately fused to an antigen tag domain fiom, among others, 
hemagglutinin or Myc. These coding sequences, when associated with regulatory 
elements which drive their expression, provide signals detectable by conventional 
means, including enzymatic, radiographic, colorimetric, fluorescence or other 
spectrographic assays, fluorescem activating cell sorting assays and immunological 

20 assays, including enzyme linked immunosorbent assay (ELISA), radioimmunoassay 
(RIA) and immunohistochemistry. For example, where the marker sequence is the 
LacZ gene, the presence of the vector carrying tije signal is detected by assays for 
beta-galactosidase activity. Where the transgene is GFP or luciferase, the vector 
carrying the signal may be measured visually by color or light production in a 

25 luminometer. 

However, desirably, the transgene is a non-marker sequence 
encoding a product which is useful in biology and medicine, such as proteins, 
peptides, KNA, enzymes, or catalytic RNAs. Desirable RNA molecules include 
tRNA, dsRNA, ribosomal RNA, si RNAs, small haiipin RNAs, trans-splicing RNAs, 
catalytic RNAs, and antisense RNAs. One example of a useful RNA sequence is a 
sequence which extinguishes expression of a targeted nucleic acid sequence in the 
treated animal. 
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The transgene may be used for treatment, e.g., of genetic 
deficiencies, as a cancer therapeutic or vaccine, for induction of an immune response, 
and/or for prophylactic vaccine purposes. As used herein, induction of an immune 
response refers to the ability of a molecule (e.g., a gene product) to induce a T cell 
and/or a humoral immune response to the molecule. The invention further includes 
using multiple transgenes, e.g., to correct or ameliorate a condition caused by a multi- 
subunit protein. In certain situations, a difierent transgene may be used to encode 
each subunit of a protein, or to encode difTerent peptides or proteins. This is desirable 
when the size of the DNA encoding the protein subunit is large, e.g., for an 
immunoglobulin, the platelet-derived growth factor, or a dystrophin protein. In order 
for the cell to produce the multi-subunit protein, a cell is infected with the 
recombinant virus containing each of the different subunits. Alternatively, different 
subunits of a protein may be encoded by the same transgene. In this case,, a single 
transgene includes the DNA encoding each of the subunits, with the DNA for each 
subunit separated by an internal ribozyme entcy site (IRES). This is desirable when 
the size of the DNA encoding each of the subunits is small, e.g., the total size of the 
DNA encoding the subunits and the IRES Is less than five Idlobases. As an alternative 
to an IRES, the DNA may be separated by sequences encoding a 2A peptide, which 
self-cleaves in a post-translational event See, e.g., M.L. Donnelly, et al, J. Gen. 
Virol., 78(Pt 1):13.21 (Jan 1997); Furler, S., eial. Gene Ther., 8(1 1):864-873 (June 
2001); Klump H., etaL, Gene Ther., 8(I0):81 1-817 (May 2001). This 2A peptide is 
significantly smaller than an IRES, making it well suited for use when space is a 
limiting factor. However, the selected transgene may encode any biologically active 
product or other product, e.g., a product desirable for study. 

Suitable transgenes may be readily selected by one of skill in 
the art. The selection of the transgene is not considered to be a limitation of this 
invention. 

2. Vector and Transgene Regulatory Elements 

In addition to the major elements identified above for the 
minigene, the adenoviral vector also includes conventional control elements which 
are operably linked to the transgene in a manner that permits its transcription, 
translation and/or expression in a cell transfected with the plasmid vector or infected 
with the Virus produced by the invention. As used herein, "operably linked'* 
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sequences include both expression control sequences that are contiguous with the 
gene of interest and expression control sequences that act in trans or at a distance to 
control the gene of interest. 

Expression control sequences include appropriate transcription 

5 initiation, termination, promoter and enhancer sequences; efficient RNA processing 
signals such as splicing and polyadenylation (polyA) signals; sequences that stabilize 
cytoplasmic mRMA; sequences that enhance translation efTiciency (i.e., Kozak 
consensus sequence); sequences that enhance protein stability; and when desired, 
sequences that enhance secretion of the encoded product. A great number of 

10 expression control sequences, including promoters which are native, constitutive, 
inducible and/or tissue-specific, are known in the art and may be utilized. 

Examples of constitutive promoters include, without 
limitation, the retroviral Rous sarcoma virus (RSV) LTR promoter (optionally with 
the RSV enhancer), the cytomegalovirus (CMV) promoter (optionally with die CMV 

15 enhancer) [see, e.g., Boshart et al. Cell. 41:521-530 (1985)], the SV40 promoter, the 
dihydrofolate reductase promoter, the P-actin promoter, the phosphoglycerol kinase 
(PGK) promoter, and the EFla promoter [Invitrogen]. 

Inducible promoters allow regulation of gene expression and 
can be regulated by exogenously supplied compounds, environmental factors such as 

20 temperature, or the presence of a specific physiological state, e.g., acute phase, a 
particular differentiation state of the cell, or in replicating cells only. Inducible 
promoters and inducible systems are available from a variety of commercial sources, 
including, without limitation, Invitrogen, Clontech and Ariad. Many other systems 
have been described and can be readily selected by one of skill in the art. For 

25 example, inducible promoters include the zinc-inducible sheep metallothionine 0^T) 
promoter and the dexamethasone (Dex)-inducible mouse mammary tumor virus 
(MMTV) promoter. Other inducible systems include the T7 polymerase promoter 
system [WO 98/10088]; the ecdysone insect promoter [No et al, Proc. Natl Acad, 
ScL t/&4, 93:3346-3351 (1996)], tiietetracycline-repressible system [Gossene/ a/, 

30 Proc. Natl Acad. ScL USA, 89:5547-5551 (1992)], tfie tetracycline-inducible system 
[Gossen et al. Science, 268: 1 766-1 769 (1 995), see also Harvey et al, Curr, Opln. 
Chem, Biol, 2:512-518 (1998)]. Other systems include tiie FK506 dimer, VP16 or 
p65 using castradiol, diphenol murislerone, the RU486-inducible system [Wang et al. 



NaL Biotech, 15:239-243 (1997) and Wang et o/. Gene Ther., 4:432-441 (1997)] and 
the rapamycin-inducible ^stcm [Magari et ah J. Clin. Invest, 100:2863-2872 
(1997)]. The effectiveness ofsonie inducible promoters increases over tinie. In such 
cases one can enhance the effectiveness of such systems by inserting multiple 
S repressors in tandem, e.g., TetR linked to a TetR by an IRES. Alternatively, one can 
wait at least 3 days before screening for the desired function. One can enhance 
expression of desired proteins by known means to enhance the effectiveness of this 
system. For example, using the Woodchuck Hepatitis Virus Posttranscriptional 
Regulatory Element (WPRE). 
10 In another embodiment, the native promoter for the transgene 

will be used. The native promoter may be preferred when it is desired that expression 
of the transgene should mimic the native expression. The native promoter may be 
used when expression of the transgene must be regulated temporally or 
developmentally, or in a tissue-specific manner, or in response to specific 
13 transcriptional stimuli. In a further embodiment, other native expression control 
elements, such as enhancer elements, polyadenylation sites or Kozak consensus 
sequences may also be used to mimic the native expression. 

Another embodiment of the transgene includes a transgene 
operably linked to a tissue-specific promoter. For instance, if expression in skeletal 
20 muscle is desired, a promoter active in muscle should be used. These include the 
promoters firom genes encoding skeletal p-actin, myosin Jight chain 2A, dystrophin, 
muscle creatine kinase, as well as synthetic muscle promoters with activities hi^er 
than naturally occurring promoters (see Li etal., Nat. Biotech., 17:241-245 (1999)). 
Examples of promoters that are tissue-specific are known for liver (albumin, 
23 Miyatake et at., J. Virol, 71:5124-32 (1997); hepatitis B virus core promoter, Sandig 
et al. Gene Ther., 3:1002-9 (1996); alpha-fetoprotein (AFP), Arbuthnot etal. Hum. 
Gene Ther., 7:1503-14 (1996)), bone osteocalcin (Stein et at., Mol. Biol Rep., 
24:185-96 (1997)); bone sialoprotein (Chen etal.,J. Bone Mner. Res., 11:654-64 
(1996)), lymphocytes (CD2. Hansal etal, J. Immunol, 161:1063-8 (1998); 
30 immunoglobulin heavy chain; T cell receptor chain), neuronal such as neuron-specific 
enolase (NSE) promoter (Andersen et al. Cell Mol Neurobiol, 13:503-15 (1993)), 
neurofilament light-chain gene (Piccioli et al., Proc. Natl Acad. Scl USA, 88:561 1-5 
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(1991)), and the neuron-specific vgf gene (Piccioli ei aL, Neuron^ 15:373-84 (1995)), 
among others. 

Optionally, vectors carrying transgenes encoding 
therapeutically useful or immunogenic products may also include selectable markers 

5 or reporter genes may include sequences encoding geneticin, hygromicin or 
purimycin resistance, among others. Such selectable reporters or marker genes 
(preferably located outside the viral genome to be packaged into a viral particle) can 
be used to signal the presence of the plasmids in bacterial cells, such as ampicilltn 
resistance. Other components of the vector may include an origin of replication. 

10 Selection of these and other promoters and vector elements are conventional and 
many such sequences are available [see, e.g., Sambrook et al, and references cited 
therein]. 

These vectors are generated using the techniques and 
sequences provided herein, in conjunction with techniques known to those of skill in 
15 the art. Such techniques include conventional cloning techniques of cDNA such as 
those described in texts [Sambrook et Molecular Cloning: A Laboratory Manual, 
Cold Spring Harbor Press, Cold Spring Harbor, NY], use of overlapping 
oligonucleotide sequences of the adenovirus genomes, polymerase chain reaction, and 
any suitable method which provides the desired nucleotide sequence. 

20 

II. Production of the Recombinant Viral Particle 

In one embodiment, the invention provides a method of generating 
recombinant chimeric adenoviral particles in which the capsid of the chimeric 
adenovirus is of a serotype incapable of efficient growth in the selected host cell. A 

25 vector suitable for production of recombinant chimeric adenoviral particles can be 
generated by direct cloning. Alternatively, such particles can be generated by 
homologous recombination between a first vector containing the left end of the 
chimeric adenoviral genome and a second vector containing the rigjht end of the 
chimeric adenoviral genome. However, any suitable methodology known to those of 

30 skill in the art can be readily utilized to generate a vector suitable to generate a 

production vector, preferably which contains the entire chimeric adenoviral genome, 
including a minigene. This production vector is then introduced into a host cell in 
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which the adenoviral capsid protein is assembled and the chimeric adenoviral particle 
assembled as described. 

The chimeric adenoviruses of the invention include those in which one or 
more adenoviral genes are absent, or otherwise rendered non-functional. If any of the 
missing gene functions are essential to the replication and infectivity of the adenoviral 
particle, these functions are supplied by a complementation (or transcomplementing) 
cell line or a helper vector expressing these functions during production of the 
chimeric adenoviral particle. 

Examples of chimeric adenoviruses containing such missing adenoviral gene 
functions include those which are partially or completely deleted in the Ela and/or 
Elb gene. In such a case, the El gene functions can be supplied by the packaging 
host cell, permitting the chimeric construct to be deleted of El gene functions and, if 
desired, for a transgene to be inserted in this region! Optionally, the El gene can be 
of a serotype which transcomplements the' serotype providing the other adenovirus 
sequences in order to further reduce the possibility of recombination and improve 
safety. In other embodiments, it is desirable to retain an intact Ela and/or Elb region 
in the recombinant adenoviruses. Such an intact El region may be located in its 
native location in the adenoviral genome or placed in the site of a deletion in the 
native adenoviral genome (e.g., in the E3 region). 

In another example, all or a portion of the adenovirus delayed early gene E3 
may be eliminated from the chimeric adenovirus. The function of adenovirus E3 is 
believed to be irrelevant to the function and production of the recombinant virus 
particle. Chimeric adenovirus vectors may also be constructed having a deletion of at 
least the 0RF6 region of the E4 gene, and more desirably because of the redundancy 
in the function of this region, the entire 84 region. Still another vector of this 
invention contains a deletion in the delayed early gene E2a. Similarly, deletions in 
the intermediate genes IX and IVaz may be useful for some purposes. Optionally, 
deletions may also be made in selected portions of the late genes LI through LS, as 
described above. 

Other deletions may be made in the otfier structural or non-structural 
adenovirus genes. The above-discussed deletions rnay be used individually, i.e., an 
adenovirus sequence for use in the present invention may contain deletions in only a 
single region. Alternatively, deletions of entire genes or portions thereof effective to 

r 
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destroy their biological activity may be used in any combination. For example, in one 
exemplary vector, the adenovirus sequence may have deletions of the El genes and 
the E4 gene, oroftheEl,E2aandE3genes,oroftheE! andE3g,nes,orofEl E2a 
and E4 genes, with or without deletion of E3. and so on. As discussed above, such 
deletions may be used in combination with other mutations, such as tempemture- 
sensitive mutations, to achieve a desired result 

Examples of suitable transcomplemcnting serotypes are provided above The 
use of transcomplemcnting serotypes can be particularly advantageous where tfiere is 
diversity between tf,e Ad sequences in tite vector of the invention and the human 
AdEl sequences found in currentlyavailable packaging cells. In such cases, the use 
of ti,e current human El containing cells prevents the generation of replication- 

competentadenovinisesduringtitereplicationandproduction process. However in 
■ certain cireumstances..itwill be desirable toutilizeacell line which expresses tite El 
gene products can be utilized for production of an EMeleted simian adenovirus 
Such ceil lines have been described. See. e.g.. US Patent 6.083.716. 
A. Packaging Host Cells 

Suitably, tile packaging host cell is selected from among cells in 
which ti,e adenovirus serotype donating tfie left and right temiinal ends of the 
chimeric genome are capable of efficient growth. The host cells are preferably of 
mammalian origin, and most preferably are of non-human primate or human origin. 

Particularly desirable host cells are selected from among any 
mammalian species, including, witfiout limitation, cells such as A549 [ATCC 
Accession No. COL 185]. 91 1 cells. WEHI. 3T3, lOTl/?. HEK 293 cells or PERC6 
(both of which express fiinctional adenoviral El) [Fallaux. FJ et al, (1 998). Oan 
Gene 77,er, 9:1909-1917]. Saos. C2C12. L cells. HT1080. HepG2. HeU [ATCC 
Accession No. CCL 2]. KB [CCL 17], Detroit [e.g.. Detroit 510. CCL 72] and WI.38 
[CCL 75] cells, and primary fibroblast, hepatocyte and myoblast cells derived fix,m 
mammals including human, monkey, mouse, rat. rabbit, and hamster. These cell lines 
are all available liom die American Type Culture Collection. 10801 University 
Boulevard. Manassas. Virginia 20110-2209. Odier suitable cell lines may be 
obtained from other sources. The selection of die mammalian species providing ti,e 
cells is not a limitation of tiiis invention; nor is die type of mammalian cell. i.e.. 
fibroblast, hepatocyte. tumor cell, etc. 
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As described above, a chimeric adenovirus of tlie invention can lack 
one or more functional adenoviral regulatory and/or structural genes which are 
supplied either by the host cell or in trans to effect packaging of the chimeric 
adenovirus into the viral capsid to generate the viral particle. Thus, the ability of a 
5 selected host cell to supply transcomplementing adenoviral sequences may be taken 
into consideration in selecting a desired host cell. 

In one example, the cells are from a stable cell line which expresses 
adenovirus Ela and Elb functions from a cell line which transcomplements the 
adenovirus serotype which donates the left and right termini to the chimera of the 

10 invention, permitting tiie chimera to be El-deleted. Alternatively, where the ceil line 
does not transcomplementthe adenovirus donating ttic termini. El fiinctions may be 
provided by the chimera, or in trans. 

If desired, one may utili:^ tiie sequences provided herein to generate a 
packaging cell or cell line tiiat expresses, at a minimum, the adenovirus El gene fiom 

15 tile adenovirus serotype donating tiie 5' ITR under tiie transcriptional control of a 
promoter for expression, or a transcomplementing serotype, in a selected parent cell 
line. Inducible or constitutive promotera may be employed for this purpose. 
Examples of such promoters are described in detail elsewhere In tiiis specification. A 
parent ceil is selected for tiie generation of a novel cell line e;q>ressing any desired 

20 adenovirus or adenovirus gene, including, e.g., a hunun Ad5, AdPanS, Pan6, Pan7, 
SVl, SV25 or SV39 gene. Witfjout limitation, such a parent cell line may be HeLa 
[ATCC Accession No. CCL 2], A549 (ATCC Accession No. CCL 185], HEK 293, 
KB ICCL 17], Detroit [e.g.. Detroit 510. CCL 72] and WI-38 [CCL 75] cells, among 
otfiers. Many oftiiese cell lines are all available from tiie ATCC. Otiier suitable 

25 parent cell lines may be obtained from other sources. 

Such El-«cpressing cell lines are useful in the generation ofxhimeric 
adenovirus El deleted vectors. Additionally, or alternatively, tiie invention provides 
cell lines tiiat express one or more simian adenoviral gene products, e.g., Ela, Elb, 
E2a, and/or E4 ORF6, can be constructed using essentially tfie same procedures for 

30 use in the generation of chimeric viral vectors. Such cell lines can be utilized to 

transcomplement adenovims vectors deleted in the essential genes tiiat encode those 
products, or to provide helper functions necessary for packagmg of a helper- 
dependent virus (e.g., adeno-associated virus). The preparation ofa host cell 
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according to this invention involves techniques such as assembly of selected DNA 
sequences. This assembly may be accomplished utilizing conventional techniques. 
Such techniques include cDNA and genomic cloning which are well known and are 
described in Sambrook et al., cited above, use of overlapping oligonucleotide 
5 sequences of the adenovirus genomes, combined with polymerase chain reaction, 
synthetic methods, and any other suitable methods which provide the desired 
nucleotide sequence. 

In still another alternative, the essential adenoviral gene products are 
provided in trans by the adenoviral vector and/or helper virus. In such an instance, a 

10 suitable host cell can be selected from any biological organism, including prokaryotic 
(e.g., bacterial) cells, and eukaryotic cells, including, insect cells, yeast cells and 
mammalian cells. Particularly desirable host cells are selected from among any 
mammalian species, including, without limitation, cells such as AS49, WEHI,3T3, 
IOTl/2, HEK 293 cells or PERC6 (both of which express functional adenoviral El) 

15 [Fallaux, FJ et al, (1998). Hum Gene Ther, 9:1909-1917], Saos, C2C12, L celb. 

HT1080, HepG2 and primary fibroblast, hepatocyte and myoblast cells derived from 
mammals including human, monkey, mouse, rat, rabbit, and hamster. The selection 
of the mammalian species providing the cells is not a limitation of this invention; nor 
is the type of mammalian cell, i.e., fibroblast, hepato<^, tumor cell, etc. 

20 B. Helper Vectors 

Thus, depending upon the adenovirus gene content of the adenoviral 
vectors and any adenoviral gene frmctions expressed from the host cell, a helper 
vector may be necessary to provide sufficient adenovirus gene sequences necessary to 
produce an infective recombinant viral particle containing tiie minigene. See, for 

25 example, the techniques described for preparation of a "minimal" human Ad vector in 
International Patent Application W096/13597, published May 9, 1996, and 
incorporated herein by reference. Suitably, these helper vectora may be non- 
replicating genetic elements, a pkismid, or a virus. 

Useful helper vectora contain selected adenovirus gene sequences not 

30 present in the adenovirus vector construct and/or not expressed by the packaging cell 
line in which the vector is transfected. In one embodiment, the helper virus is 
replication-defective and contains a variety of adenovirus genes in addition to the 
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sequences described above. Such a helper vector is desirably lised in combination 
with an El -expressing cell line. 

Helper vectors may be formed into poly-cation conjugates as 
described in Wu et al,J. Biol. Chem., 264:16985-16987 (1989); It. J- Fisher and J. M. 
Wilson, Biochem. J., 299:49 (April 1, 1994). A helper vector may optionally contain 
a second reporter minigene. A number of such reporter genes are known to the art. 
The presence of a reporter gene on the helper virus which is different from the 
transgene on the adenovirus vector allows both the Ad vector and the helper vector to 
be independently monitored. This second reporter is used to enable separation 
between the resulting recombinant virus and the helper virus upon purification. 

C. Assembly of Viral Particle and Transfection of a Cell Line 
Generally, when delivering the vector comprising the minigene by 
transfection, the vector is delivered in an amount* from about S fig to about 100 pg 
IS DNA, and preferably about 10 to about SO fig DNA to about 1x10^ cells to about 1 x 
lO'^ cells, and preferably about 10^ cells. However, the relative amounts of vector 
DNA to host cells may be adjusted, talcing into consideration such factors as the 
selected vector, the delivery method and the host cells selected. 

Introduction into the host cell of the vector may be achieved by any 
20 means known in the art or as disclosed above, including transfection, and infection. 
One or more of the adenoviral genes may be stably integrated into the genome of the 
host cell, stably expressed as episomes, or expressed transiently. The gene products 
may all be expressed transiently, on an episome or stably integrated, or some of the 
gene products may be expressed stably while others are expressed transiently. 
2S Furthermore, the promoters for each of the adenoviral genes may be 

selected independently from a constitutive promoter, an inducible promoter or a 
native adenoviral promoter. The promoters may be regulated by a specific 
physiological state of the organism or cell (i.e., by the differentiation state or in 
replicating or quiescent cells) or by exogenously added factors, for example. 
30 Introduction of the molecules (as plasmids or viruses) into the host 

cell may also be accomplished using techniques known to the skilled artisan and as 
discussed throughout the specification. In preferred embodiment, standard 
transfection techniques are used, e.g., CaP04 transfection or electroporatton. 
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Assembly of the selected DNA sequences of the adenovirus (as well 
as the transgene and other vector elements) into various intemiediate plasmids. and 
the use of the plasmids and vectors to produce a recombinant viral particle are all 
achieved using conventional techniques. Such techniques include direct cloning as 
described [G. Gao et al. Gene Ther. 2003 Oct; 10(22):1926.1930; US Patent 
Publication No. 2003-00921 61 -A, published May 15, 2003; International Patent 
AppIicationNo. PCT/US03/12405]. Other cloning techniques of cDNA such as 
those described in texts [Sambrook et al, cited above], use of overlapping 
oligonucleotide sequences of the adenovirus genomes, polymerase chain reaction, and 
any suitable method which provides the desired nucleotide sequence can be utilized 
Standard tiansfection and co-transfection techniques are employed, e.g., CaP04 
precipitation techniques. Other conventional methods employed include homologous 
recombination of the viral genomes, plaquing of viruses in agar overlay, methods of 
measuring signal generation, and the like. 

For example, following the construction and assembly of the desired 
mmigenen^ontaining viral vector, the vector is tmnsfected in vitro in the presence of 
an optional helper vector into the packaging cell line. The functions expressed flom 
the plasmid. packaging cell line and helper virus, if any, pemiits the adenovirus- 
transgene sequences in the vector to be replicated and packaged into virion capsids 
resultmg in the chimeric viral particles. The cunent method for producing such vir^s 
particles is transfection-based. However, the invention is not limited to such 
methods. The resulting chimeric adenoviruses are useful in transferring a selected 
transgene to a selected cell. 

III. Use of the Chimeric Adenovirus Vectors 

The chimeric adenovirus vectors of the invention are useful for gene transfer 
to a human or veterinary subject (including, non-human primates, non-simian 
primates, and other mammals) in vitro, ex vivo, and in vivo. 

The recombinant adenovirus vectors described herein can be used as 
expression vector for the production of the products encoded by the heterologous 
genes in vitro. For example, the recombinant adenoviruses containing a gene inserted 
mto die location of an El deletion may be transfected into an El-expressing cell line 
as described above. Alternatively, replication-competent adenoviruses may be used 
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in another selected cell line. The transfected cells are then cultured in the 
conventional manner, allowing the recombinant adenovirus to express the gene 
product from the promoter. The gene product may then be recovered from the culture 
medium by known conventional methods of protein isolation and recovery from 
S culture. 

A chimeric adenoviral vector of the invention provides an efficient gene 
transfer vehicle that can deliver a selected transgene to a selected host cell in vivo or 
ex vivo even where the organism has neutralizing antibodies to one or more AAV 
serotypes. In one embodiment, the rA A V and the cells are mixed ex vivo; the infected 
10 cells are cultured using conventional methodologies; and the transduced cells are re- 
infused into the patient. These compositions are particularly well suited to gene 
delivery for therapeutic purposes and for immunization, including inducing protective 
immunity. 

More commonly, the chimeric adenoviral vectors of the invention will be 

1 5 utilized for delivery of therapeutic or immunogenic molecules, as described below. It 
will be readily understood for both applications that the recombinant adenoviral 
vectors of the invention are particularly well suited for use in regimens involving 
repeat delivery of recombinant adenoviral vectors. Such regimens typically involve 
delivery of a series of viral vectors in which the viral capsids are alternated. The viral 

20 capsids may be changed for each subsequent administration, or after a pre-selected 
number of administrations of a particular serotype capsid (e.g., one, two, three, four 
or more). Thus, a regimen may involve delivery of a rAd with a first capsid, delivery 
with a rAd with a second capsid, and delivery with a third capsid. A variety of other 
regimens which use the Ad capsids of the invention alone, in combination with one 

25 another, or in combination with other Ad serotypes will be apparent to those of skill 
in the art. Optionally, such a regimen may involve administration of rAd with capsids 
of non-human primate adenoviruses, human adenoviruses, or artificial (e-g., chimeric) 
serotypes such as are described herein. Each phase of the regimen may involve 
administration of a series of injections (or other delivery routes) with a single Ad 

30 serotype capsid followed by a series with another Ad serotype capsid. Alternatively, 
the recombinant Ad vectors of the invention may be utilized in regimens involving 
other non-adenoviral-mediated delivery systems, including other viral systems, non- 
viral delivery systems, protein, peptides, and other biologically active molecules. 
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The following sections will focus on exemplary molecules which may be 
delivered via the adenoviral vectors of the invention. 

A. Ad-Mediated Delivery of Therapeutic ly/lolecules 

In one embodiment, the Ad vectors described herein are administered 
to humans according to published methods for gene therapy. A viral vector of the 
invention bearing the selected transgene may be administered to a patient, preferably 
suspended in a biologically compatible solution or pharmaceutically acceptable 
delivery vehicle. A suitable vehicle includes sterile saline. Other aqueous and non- 
aqueous isotonic sterile injection solutions and aqueous and non-aqueous sterile 
suspensions known to be pharmaceutically acceptable carriers and well known to 
those of skill in the art may be employed for this purpose. 

The adenoviral vectors are administered in sufHcient amounts to 
transduce the target cells and to provide sufficient levels of gene transfer and 
expression to provide a therapeutic benefit without undue adverse or with medically 
acceptable physiological effects, which can be determined by those skilled in the 
medical arts. Conventional and pharmaceutically acceptable routes of administration 
include, but are not limited to, direct delivery to the retina and other intraocular 
delivery methods, direct delivery to the liver, inhalation,- intranasal, intravenous, 
intramuscular, intratracheal, subcutaneous, intradermal, rectal, oral and other 
parenteral routes of administration. Routes of administration may be combined, if 
desired, or adjusted depending upon the transgene or the condition. The route of 
administration primarily vnll depend on the nature of the condition being treated. 

Dosages of the viral vector will depend primarily on &ctors such as 
the condition being treated, the age, weight and healtfi of the patient, and may thus 
vary among patients. For example, a therapeutically efTective adult human or 
veterinary dosage of the viral vector is generally in the range of from about 100 pJL to 
about 100 mL of a carrier containing concentrations of from about 1 x 10^ to about 1 
X 10*^ particles, about 1 x lO" to 1 x lO" particles, or about 1 x lO' to Ix 10^^ 
particles. Dosages will range depending upon the size of the animal and the loute of 
administration. For example, a suitable human or veterinary dosage (for about an 80 
kg animal) for intramuscular injection is in the range of about 1 x 10^ to about S x 
10'^ particles per mL, for a single site. Optionally, multiple sites of administration 
may be delivered. In another example, a suitable human or veterinary dosage may be 
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in the range of about 1 x lO" to about 1 x 10'^ particles for an oral formulation. One 
of skill in the art may adjust these doses, depending the route of administration, and 
the therapeutic or vaccinal application for which the recombinant vector is employed. 
The levels of expression of the transgene. or for an immunogen, the level of 
circulating antibody, can be monitored to determine die frequency of dosage 
administration. Yet other methods for determining the timing of frequency of 
administration vWll be readily apparent to one of sicill in the art. 

An optional method step involves the co-administration to the 
patient, either concurrently with, or before or after administration of Ae viral vector, 
of a suitable amount of a short acting immune modulator. The selected immune 
modulator is defined herein as an agent capable of inhibiting the formation of 
neutralizing antibodies directed against the recombinant vector of this invention or 
capable of inhibiting cytolytic T lymphocyte (CTL) elimination of the vector. The 
immune modulator may interfere with the interactions between the T helper subsets 
(Thi or Th2) and B cells to inhibit neutralizing antibody formation. Alternatively, the 
immune modulator may inhibit the interaction between Th, cells and CTLs to reduce 
the occurrence of CTL elimination of the vector. A variety of useful immune 
modulatore and dosages for use of same are disclosed, for example, in Yang et al.,J. 
Virol, 70(9) (Sept 1996); International Patent Application No. WO96/12406, 
published May 2, 1996; and International Patent Application No.PCT/US96/03035, 
all incorporated herein by reference. Typically, such immune modulators would be 
selected when the transgene is a therapeutic which requires repeat delivery. 
1. Therapeutic Transgenes 

Useful therapeutic products encoded by the transgene include 
25 hormones and growth and differentiation factors including, without limitation, insulin, 
glucagon, growth hormone (GH). parathyroid homione (PTH), growth hormone 
releasing factor (GRF),. follicle stimulating hormone (FSH), luteinizing hormone 
(LH), human chorionic gonadotropin (hCG). vascular endothelial growth factor 
(VEGF). angiopoietins. angiostatin, granulocyte colony stimulating factor (GCSF). 
30 erythfopoietin (EPO), connective tissue growth factor (CTGF), basic fibroblast 
growth factor (bFGF), acidic fibroblast growth factor (aFGF), epidermal growth 
fector (EGF), transforming growth factor a (TGF a), platelet-derived growth factor 
(PDGF). insulin growth factore 1 and H OGF-I and IGF-Il). any one of the 
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transforming growth factor super&mily, including TGF, activins, inhibins, or any of 
the bone morphogenic proteins (BMP) BMPs 1-lS, any one of the 
heregluin/neureguIin/ARIA/neu differentiation factor (NOP) family of growth factors, 
nerve growth factor (NGF), brain-derived neurotrophic factor (BDNF), neurotrophins 
5 NT-3 and NT.4/5, ciliary neurotrophic factor (CNTF), glial cell line derived 
neurotrophic factor (GDNF), neurturin, agrin, any one of the family of 
semaphorins/collapsins, netrin*l and netrin-2, hepatocyte growth factor (HOP), 
ephrins, noggin, sonic hedgehog and tyrosine hydroxylase. 

Other useful transgene products include proteins that regulate the 
10 immune system including, without limitation, cytokines and lymphokines such as 

thrombopoietin (TPO), interleukins (IL) IL-1 through IL-25 (including, e.g., IL-2, IL- 
4, lL-12 and IL-18), monocyte chemoattractant protein, leukemia inhibitory factor, 
granulocyte-macrophage colony stimulating factor. Fas ligand, tumor necrosis factors 
and, interferons, and, stem cell factor, flk-2/flt3 ligand. Gene products produced by 
IS the immune system are also useful in the invention. These include, without 

limitation, immunoglobulins IgG, IgM, IgA, IgD and IgE, chimeric immunoglobulins, 
humanized antibodies, single chain antibodies, T cell receptors, chimeric T cell 
receptors, single chain T cell receptors, class I and class II MHC molecules, as well as 
engineered immunoglobulins and MHC molecules. Useful gene products also include 
20 complement regulatory proteins such as complement regulatory proteins, membrane 
cofactor protein (MCP), decay accelerating factor (DAF), CRl, CF2 and CD59. 

Still other useful gene products include any one of the receptors for 
the hormones, grov^ factors, cytokines, lymphokines, regulatory proteins and 
immune system proteins. The invention encompasses receptors for cholesterol 
25 regulation, including the low density lipoprotein (LDL) receptor, high density 
lipoprotein (HDL) receptor, the very low density lipoprotein (VLDL) receptor, 
proteins useful in the regulation of lipids, including, e.g., apolipoprotein (apo) A and 
its isoforms (e.g., ApoAI), apoE and its isoforms including E2, E3 and E4), SRBl, 
ABC 1, and the scavenger receptor. The invention also encompasses gene products 
30 such as members of the steroid hormone receptor superfamily including 

glucocorticoid receptors and estrogen receptors. Vitamin D receptors and other 
nuclear receptors. In addition, useful gene products include transcription factors such 
as jun, fos, max, mad, serum response factor (SRF), AP-1, AP2, myb, MyoD and 
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myogenin. ETS-box containing proteins. TFE3. E2F, ATFl, ATF2, ATF3, ATF4 
ZF5, NFAT. CREB. HNF-4. C/EBP, SPl, CCAAT-box binding proteins. Lrfeii,n 
regulation fector (IRF-1). Wilms tumor protein, ETS-binding protein, STAT. QATA- 
box binding proteins, e.g.. GATA.3. and the forkhead femily of winged helix 
5 proteins. 

Other useful gene products include, carbamoyl synthetase I. ornithine 
transcarbamylase, arginosuccinate synthetase, arginosuccinate lyase, arginase. 
fiimarylacetacetate hydrolase, phenylalanine hydroxylase, alpha-! antitrypsin,' 
gIucose-6-phosphatase, porphobilinogen deaminase, cystathione beta-synthasL, 
10 branched chain ketoacid decarboxylase, albumin, isovaleiyl-coA dehydrogenase, 

propionyl CoA carboxylase, methyl malonyl CoA mutase, glutaryl CoA 
dehydrogenase, insulin, beta-glucosidase, pyruvate carboxylate, hepatic 
phosphorylase. phosphorylase kinase, glycine decarboxylase, H-protein. T-protein, a 
cystic fibrosis transmembrane regulator (CFTR) sequence, and a dystrophin cDNA 
sequence. Other useful gene products include those useful for treatment of 
hemophilia A (e.g.. Factor VIII and its variants, including the light chain and heavy 
chain of the heterodimer. optionally operably linked by a junction), and the B-domain 
deleted Factor VIII. see US 6,200,560 and 6,221,349], and useful for treatment of 
hemophilia B (e.g. Factor IX). 

Still other useful gene products include non-naturally occurring 
polypeptides, such as chimeric or hybrid polypeptides having a non-naturally 
occurring amino acid sequence containing insertions, deletions or amino acid 
substitutions. For example, single-chain engineered immunoglobulins could be useful 
in certain immunocompromised patients. Other types of non-naturally occurring gene 
sequences include antisense molecules and catalytic nucleic acids, such as ribozymes. 
which could be used to reduce oveiexpression of a target. 

Reduction and/or modulation of expression of a gene are particularly 
desirable for treatment of hyperprolifeiatlVe conditions characterized by 
hypeiproliferating cells, as are cancers and psoriasis. Target polypeptides include 
those polypeptides which are produced exclusively or at higher levels in 
hypeiproliferative cells as compared to normal cells. Target antigens include 
polypeptides encoded by oncogenes such as myb. myc. fyn. and the translocation 
gene bcr/abl. ras, arc, P53. neu. trie and EORF. In addition to oncogene products as 
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target antigens, target polypeptides for anti-cancer treatments and protective tegimens 
include variable regions of antibodies made by B cell lymphomas and variable 
regions of T cell receptors of T cell lymphomas which, in some embodiments, are 
also used as target antigens for autoimmune disease. Other tumor-associated 
5 polypeptides can be used as target polypeptides such as polypeptides which are found 
at higher levels in tumor cells including the polypeptide recognized by monoclonal 
antibody 17-lA and folate binding polypeptides. 

Other suitable therapeutic polypeptides and proteins include those 
which may be useful for treating individuals suffering firam autoimmune diseases and 

10 disorders by conferring a broad based protective immune response against targets that 
are associated with autoimmunity including cell receptors and cells which produce 
self-directed antibodies- t-cell mediated autoimmune diseases include rheumatoid 
arthritis (RA). multiple sclerosis (MS), Sjogren's syndrome, sarcoidosis, insulin 
dependent diabetes mellitus (IDDM), autoimmune thyroiditis, reactive arthritis, 

1 5 ankylosing spondylitis, scleroderma, polymyositis, dennatomyositis, psoriasis, 

vasculitis, Wegener's granulomatosis. Crohn's disease and ulcerative colitis. Each of 
these diseases is characterized by T cell receptors (TCRs) that bind to endogenous : 
antigens and initiate the inflammatory cascade associated with autoimmune diseases. 

The chimeric adenoviral vectors of the invention are particularly well 

20 suited for therapeutic regimens in which multiple adenoviral-mediated deliveries of 
transgenes is desired, e.g., in regimens involving redelivery of the same transgene or 
in combination regimens involving delivery of other transgenes. Such regimens may 
involve administration of a chimeric adenoviral vector, followed by re-administration 
with a vector from the same serotype adenovirus. Particulariy desirable regimens 

25 involve administration of a chimeric adenoviral vector of the invention, in which the 
serotype of the viral vector delivered in the first administration differs from the 
serotype of the viral vector utilized in one or more of the subsequent administrations. 
For example, a therapeutic regimen involves administration of a chimeric vector and 
repeat administration with one or more adenoviral vectors of the same or different 

30 serotypes. In another example, a therapeutic regimen involves administration of an 
adenoviral vector followed by repeat administration with a chimeric vector of the 
invention which differs from the serotype of the first delivered adenoviral vector, and 
optionally fiirther administration with another vector which is the same or, preferably. 
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<l.(6« fto™ the «row= of ,he vector in prior adn,i„ia„rton aep,. T1«. 
«gim«,s «e „« li„i„d to deliveo- of adenoviral ve«o„ «,Mm«=d .Bin- 
« se™«^ of Ae i„v«.«on. Rafl,er. ftese «8in»„. «„, ™dily «i,ize 
ch,^nc or non^hin-erio ve«on. of ofter adenoviral „,ow«>. which may l« of 
«,««... hn„», or non-human prin,a.e. or Cher ™...„ia„ .our^es. i„ combination 
W.U. one or n,« of *e chimeric v«:.o„, of *e invention. Exan^lcs of ««* 
«'m^ ^ di^u««l .lse»here in d,is document Further, these ,h«ape«ic 
t^unena may involve either simultaneous or sequential deliveor of chimeric 
«.enov,ml vecto. of the invention in combination with „on-adenovi,al vectors. 
«™iv^«^a„d/„ravarie.yofo.h..he.apeu.icallyuse«.lcompou„ds.^^^^^^ 

2 r^"".;™ » •'•"^ »ei"«ns. a vari^y of ^ich 

Will be readily apparent to one of skill in the art. 

B. Ad-Mediated Deliveor of Immunogenic Tiansgenes 

The adenovinises of the invention may also be employed as 

.mmunosenic compositions. As used he„i„. an immunogenic composition is a 

»mpos,t,„„ to Which a humoral (e.g., anUbody) or cellular (e.g., a cytotoxic T cdO 

n=sponse ,s mounted to a transgene p«Kl„ct d.live«l by the immunogenic 

comp^ttion following deiivety ,„ a mammal, and p,.fembly a pri„«e. TV present 

~p»videsanAdtha,canco«aini„anyofi.saden„vin.ssc<,ue«.d^,«^ 
M^ne e««.mg a desired immu™>gen. Chimeric adenoviruses bas«, on simian or 

o.h.^^non-humanmammalia„prima,ese™type,a«ii,cclytobeb«t.r,uitedfbruse 
« a ln» recombtnan. .in,s vaccine in different animal species compared to an 

«fc»ov,™s of h»™n origin, but is not limited to such a use. TTe rccombh»« 
«<«K.vm,sesc«, be used as prophylactic orthempcutic vaccines agamst any 
P«*ogen fbr which the antlgen(s) cntcia, for induction of an immm» n,^ and 
^.»hm,.thesp.eadofd,cpathogenhaabeenide„«fied»^ 



•, K. . ■• ^*'^''^^'"°*"™"'""°«="''=)'='>"'P<>»i«<'nsa«,fonnulatedin 
a auttabl. dehvcry vehlcle. as describ«l above. Generally, doses fortte 

mtmunogenic compoaitiona «c in the ntnge defined above for the,»,eu.ic 

compositions 7*0 levels of hnmunityoftheselectedgenec^, be monitored.. 
d««m.ne«»„eed.ifa„y.8„b«..,„.p„„„„,„^.„^^^^^ 

the senmi. optional booster immunizations may be desired. 
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Optionally, a vaccinal composition of the invention may be formulated 
to contain other components, including, e.g. adjuvants, stabilizers, pH adjusters, 
preservatives and the like. Such components are well known to those of skill in the 
vaccine art. Examples of suitable adjuvants include, without limitation, liposomes, 
alum, monophosphoiyl lipid A, and any biologically active fector, such as cytokine, 
an interleukin. a chemokine, a ligands. and optimally combinations thereof. Certain of 
these biologically active factors can be expressed in vivo, e.g., via a plasmid or viral 
vector. For example, such an adjuvant can be administered with a priming DNA 
vaccine encoding an antigen to enhance the antigen-specific immune response 
compared with the immune response generated upon priming with a DNA vaccine 
encoding the antigen only. 

The adenoviruses are administered in "an immunogenic amount", that 
is, an amount of adenovirus that is effective in a route of administration to tnmsfect 
the desired cells and provide sufficient levels of expression of the selected gene to 
15 induce an immune response. Where protective immunity is provided, the 

recombinant adenoviruses are considered to be vaccine compositions usefUl in 
preventing Infection and/or recurrent disease. 

Alternatively, or in addition, die vectois of the invention may contain 
a transgene encoding a peptide, polypeptide or protein which induces an immune 
20 response to a selected immunogen. The recombinant adenoviruses of this invention 
are expected to be highly efficacious at inducing cytolytic T cells and antibodies to 
the inserted heterologous antigenic protein expressed by the vector. 

For example, immunogens may be selected from a variety of viral 
femilles. Example of desirable viral families against which an immune response 
would be desirable include, the picomavirus family, which includes the genera 
rhinoviruses, which are responsible for about 50% of cases of the common cold; the 
genera enteroviruses, which include polloviruses. coxsackieviruses, echoviruses'. and 
human enteroviruses such as hepatitis A virus; and the genera apthoviruses. which are 
responsible for foot and mouth diseases, primarily in non-human animals. Witfiin the 
picornavirus family of viruses, target antigens include the VPl, VP2, VP3, VP4, and 
VPG. Another viral family includes tfie calcivirus family, which encompasses the 
Norwalk group of viruses, which are an important causative agent of epidemic 
gastroenteritis. Still another viral family desirable for use in targeting antigens for 
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inducing immune responses in humans and non-human animals is the togavinis 
family, which includes the genera alphavirus, which include Sindbis viruses, 
RossRiver virus, and Venezuelan, Eastern & Western Equine encephalitis, and 
rubivirus, including Rubella virus. The flaviviridae family includes dengue, yellow 

S fever, Japanese encephalitis, St. Louis encephalitis and tick borne encephalitis 
viruses. Other target antigens may be generated from the Hepatitis C or the 
coronavirus fomily, which includes a number of non-human viruses such as infectious 
bronchitis virus (poultiy), porcine transmissible gastroenteric virus (pi^, porcine 
hemagglutinatin encephalomyelitis virus (pig), feline infectious peritonitis virus 

1 0 (cats), feline enteric coronavirus (cat), canine coronavirus (dog), and human 

respiratory coronaviruses, which may cause the common cold and/or non-A, B or C 
hepatitis. In addition, the human coronaviruses include the putative causative agent 
of sudden acute respiratoiy syndrome (SARS). Within the coronavirus family, target 
antigens include the El (also called M or matrix protein), E2 (also called S or Spike 

1 5 protein), E3 (also called HE or hemagglutin-elterose) glycoprotein (not present in all 
coronaviruses), or N (nucleocapsid). Still other antigens may be targeted against the 
rhabdovirus family, which includes the genera vesiculovirus (e,g.. Vesicular 
Stomatitis Virus), and the general lyssavlrus (e,g.. rabies). Within the rfiabdovirus 
femily, suitable antigens may be derived from the G protein or ttie N protein. The 

20 family filoviridae, which includes hemorrhagic fever viruses such as Marburg and 
Ebola virus, may be a suitable source of antigens. The paramyxovirus family 
includes parainfluenza Virus Type 1, parainfluenza Virus Type 3, bovine 
parainfluenza Virus Type 3, rubulavirus (mumps virus), parainfluenza Virus Type 2, 
parainfluenza virus Type 4, Newcastie disease virus (chickens), rinderpest, 

25 morbillivirus, which includes measles and canine distemper, and pneumovirus, which 
includes respiratory syncytial virus. The influenza virus is classified within the 
family orthomyxovirus and is a suitable source of antigen (e.g., tiie HA protein, the 
Ml protein). The bunyavirus family includes tiie geneia bunyavirus (California 
encephalitis. La Crosse), phlebovirus (Rifl Valley Fever), hantavirus (puremala is a 

30 hemahagin fever virus), nairovirus (Nairobi sheep disease) and various unassigned 
bungaviruses. The arenavirus family provides a source of antigens against LCM and 
Lassa fever virus. The reovirus family includes the genera reovirus, rotavirus (which 
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causes acute gastroenteritis in children), orbiviruses, and cultivirus (Colorado Tick 
fever), Lebombo (humans), equine encephalosis, blue tongue. 

The retrovirus family includes the sub-family oncorivirinal which 
encompasses such human and veterinary diseases as feline leukemia virus, HTLVI 

S and HTLVII, lentivirinal (which includes human immunodeficiency virus (HIV), 
simian immunodeficiency virus (SIV), feline immunodeficiency virus (FIV), equine 
Infectious anemia virus, and spumavirinal). Among the tentiviruses, many suitable 
antigens have been described and can readily be selected. Examples of suitable HIV 
and SIV antigens include, without limitation the gag, pol, Vif, Vpx, VPR, Env, Tat, 

10 Mef, and Rev proteins, as well as various fragments thereof. For example, suitable 
fragments of die Env protein may include any of its subunits such as the gpl20, 
gpl60, gp41, or smaller fragments thereof, e.g., of at least about 8 amino acids in 
length. Similarly, fragments of the tat protein may be selected. [See, US Patent 
5,891,994 and US Patent 6,193.981 .] See, also, the HIV and SIV proteins described 

15 in D.H. Barouch ei al, J. ViroL^ 75(5):2462-2467 ^arch 2001), and R.R. Amara, et 
al. Science, 292:69-74 (6 April 2001). In another example, the HIV and/or SIV 
immunogenic proteins or peptides may be used to form fusion proteins or other 
immunogenic molecules. See, e.g., the HIV-1 Tat and/or Nef fusion proteins and 
immunization regimens described in WO 01/S4719, published August 2, 2001, and 

20 WO 99/16884, published April 8, 1999. The invention is not limited to the HIV 

and/or SIV immunogenic proteins or peptides described herein. In addition, a variety 
of modifications to these proteins have been described or could readily be made by 
one of skill in the art. See, e.g., the modified gag protein that is described in US 
Patent 5,972,596. Further, any desired HIV and/or SIV immunogens may be 

25 delivered alone or in combination. Such combinations may include expression fi-om a 
single vector or from multiple vectors. Optionally, another combination may involve 
delivery of one or more expressed immunogens with delivery of one or moro of the 
immunogens in protein form. Such combinations are discussed in more detail below. 

The papovavirus family includes the sub-family polyomaviruses 

30 (BKU and JCU viruses) and the sub-family papillomavirus (associated with cancers 
or malignant progression of papilloma). The adenovirus family includes viruses (EX, 
AD7, ARD, O.B.) which cause respiratory disease and/or enteritis. The parvovirus 
includes family feline parvovirus (feline enteritis), feline panleucopeniavirus, canine 
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parvovirus, and porcine parvovirus. The herpesvirus family includes the sub-family 
alphaherpesvirinae. v^hich encompasses the genera simplexvirus (HSVI HSVII) 
varicellovirus (pseudorabies. varicella zoster) and the sub-family betahe^esvirinae 
which mdudes the genera cytomegalovirus (HCMV. muromegalovirus) and the 
sub-femily gammaherpesvirinae. which includes the genera lymphoc^^ptovirus EBV 
(Burkitts lymphoma), infectious rhinotracheitis. Ma,«k's disease virus and 
rhadinovirus. The poxvirus family includes the sub-family chordopoxlirinae. which 
encompasses the ^nera orthopoxvirus (Variola (Smallpox) and Vaccinia (Cowpox)) 
parapoxvinis. avipoxvirus. capripoxvirus. leporipoxvirus. suipoxvirus, and the 
sub-family entomopoxvirinae. The hepadnavirus femily includes the Hepatitis B 
virus. One unclassified virus which may be suitable source of antigens is the 
Hepatitis delta virus. Still other viral sources may include avian infectious bui^l 
disease virus and porcine respiratory and reproductive syndrome virus The 
alphavirus family includes equine arteritis virus and various Encephalitis viruses. 

The viruses of the present invention may also carry immunogens 

which are usefol to immunizeahuman or non-human animal against other pathogens 
including bacteria, fungi, parasitic microorganisms or multicellular parasites which 
infect human and non-human vertebrates, or from a cancer cell or tumor cell 
Examples of bacterial pathogens include pathogenic gram-positive cocci include 
pneumococci; staphylococci; and streptococci. Pathogenic giam-negative cocci 
include meningococcus; gonococcus. Pathogenic enteric gram-negative bacilli 
include enterobacteriaceae; pseudomonas. acinetobacteria and eilcenella; melioidosis- 
salmonella; shigella; haemophilus; moraxella; H. ducreyi (which causes chancroid)- 
brucella; Franisella tularensis (which causes tularemia); yei^mia (pasteurella)- ' 
streptobacillus moniliformis and spirillum; Gram-positive bacilli include listeria 
monocytogenes; eo«ipeIothrix rhusiopathiae; Corymbacterium diphtheria 
(diphtheria); cholera; B. anthracis (anthrax); donovanosis (g.^„,oma inguinale); and 
bartonellosis. Diseases caused by pathogenic anaerobic bacteria include tetanus- 
botulism; other Clostridia; tuben^ulosis; leprosy; and other mycobacteria. Pathogenic 
spirochetal diseases include syphilis; treponematoses: yaws, pinta and endemic 
syphilis; and leptospirosis. Other infections caused by higher pathogen bacteria and 
pathogenic fungi include actinomycosis; nocardiosis; cryptococcosis, blastomycosis, 
histoplasmosis and coccidioidomycosis; candidiasis, aspergillosis, and muconnycosis; 
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sporotrichosis; paracoccidiodomycosis. petriellidiosis. torulopsosis. mycetoma and 
chromomycosis: and dermatophytosis. Rickettsial Infections Include Typhus fever, 
Rocky Mountain spotted fever, Q fever, and Rickettsialpox. Examples of 
mycoplasma and chlamydial infections include: mycoplasma pneumoniae; 
lymphogranuloma venereum; psittacosis; and perinatal chlamydial infections. 
Pathogenic eukaiyotes encompass pathogenic protozoans and helminths and 
infections produced thereby include: amebiasis; malaria; leishmaniasis; 
trypanosomiasis; toxoplasmosis; Pneumocystis caHnii; Trichans; Toxoplasma gondii- 
babesiosis; giardiasis; trichinosis; filariasis; schistosomiasis; nematodes; trematodes ' 
or flukes; and cestode (tapeworm) infections. 

Many of these organisms and/or toxins produced thereby have been 
identified by the Centers.for Disease Control [(CDC). Department of Heath and 
Human Services. USA], as agents which have potential for use in biological attacks. 
For example, some of these biological agents, include. Bacillus anthracis (anthrax) 
Clostridium botutimm and Its toxin (botulism). Yersinia pestis (plague), variola major 
(smallpox). Francisella tularensis (tularemia), and viral hemo^hagic fevers 
Ifiloviruses (e.g.. Ebola, Marburg], and arenaviruses [e.g.. Lassa. Machupo]). all of 
which are currently classified as Category A agents; Coxiella burnetti (Q fever); 
Brucella species (brucellosis). Burkholderia mallei (glanders), Burkholderia 
pseudomallei (meloidosis). Ricinus communis and its toxin (ricin toxin), Clostridium 
perfringens and its toxin (epsHon toxin). Staphylococcus species and their toxins 
(enterotoxin B), Chlamydia psittqci (psittacosis), water safety threats (e.g.. Vibrio 
cholerae, Crytosporidium parvum). Typhus fever {Rickettsia powazekii), and viral 
encephalitis (alphaviruses. e.g.. Venezuelan equine encephalitis; eastern equine 
encephalitis; western equine encephalitis); all of which are currently classified as 
Category B agents; and Nipan virus and hantaviruses, which are currently classified 
as Category C agents. In addition, other organisms, which are so classified or 
differently classified, may be identified and/or used for such a pu^jose in the future 
It will be readily understood that the viral vectors and other constructs described 
herein are useful to deliver antigens ftom dtese organisms, viruses, their toxins or 
other byproducts, which will prevent and/or treat infection or other adverse reactions 
with these biological agents. 
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Administration of the vectors of the invention to deliver immunogens 
against the variable region of the T cells elicit an immune response including CTLs to 
eliminate those T cells. In RA, several specific variable regions of TCRs which are 
involved in the disease have been characterized. These TCRs include V-3, V-M, 
5 V.17 and Va-IT. Thus, delivery of a nucleic acid sequence that encodes at least one 
of these polypeptides will elicit an immune response that will target T cells involved 
In RA. In MS, several specific variable regions of TCRs which are involved in the 
disease have been characterized. These TCRs include V-? and Va- 10. Thus, delivery 
of a nucleic acid sequence that encodes at least one of these polypeptides will elicit an 
10 immune response that will target T cells involved in MS. In scleroderma, several 
specific variable regions of TCRs which are involved in the disease have been 
characterized. These TCRs include V-6. V-8. V-14 and Va-ld, Va-SC. Va-7. Va-H. 
Va.l5. Va.16. Ya.28 and Va-12. Thus, delivery of a chimeric adenovirus thlt 
encodes at least one of these polypeptides will elicit an immune response that will 
1 5 target T cells involved in scleroderma. 

C. Ad-Medlated Delivery Methods 

The therapeutic levels, or levels of immunity, of the selected gene can 
be monitored to determine the need, if any. for boosters. Following an assessment of 
CD8+ T cell response, or optionally, antibody titers, in the serum, optional booster 
20 immunizations may be desired. Optionally, the adenoviral vectors of the invention 
may be delivered in a single administration or in various combination regimens. e.g., 
in combination witfi a regimen or course of treatment involving other active 
ingredients or in a prime-boost regimen. A variety of such regimens have been 
described in the art and may be readily selected. 

For example, prime-boost regimens may involve the administration of 
a DNA (e.g., plasmid) based vector t» prime the immune system to a second or 
further, booster, administration with a traditional antigen, such as a protein or a 
recombinant virus carrying the sequences encoding such an antigen. See, e.g., WO 
00/1 1 1 40. published Mareh 2. 2000. incorporated by reference. Alternatively.' an 
immunization regimen may involve the administration of a chimeric adenoviral vector 
of the invention to boost the immune response to a vector (either viral or DNA-based) 
carrying an antigen, or a protein. In still another alternative, an immunization 
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regimen involves administration of a protein followed by booster with a vector 
encoding the antigen. 

In one embodiment, the invention provides a method of priming and 
boosting an immune response to a selected antigen by delivering a plasmid DNA 
vector carrying said antlgpn» followed by boosting with an adenoviral vector of the 
invention. In one embodiment, the prime-boost regimen involves the expression of 
muWproteins from the prime and/or the boost vehicle. See, e.g., R.R. Amara, 
Science, 292:69-74 (6 April 2001) which describes a multiprotein regimen for 
expression of protein subunits useful for generating an immune response against HIV 
and SIV. For example, a DNA prime may deliver the Gag, Pol, Vif, VPX and Vpr 
and Env, Tat, and Rev fiom a single transcript. Alternatively, the SIV Gag, Pol and 
HIV-1 Env is delivered in a recombinant adenovirus construct of the invention. Still 
other regimens are described in WO 99/1.6884 and WO 01/54719. 

However, the prime-boost regimens are not limited to immunization 
for HIV or to delivery of these antigens. For example, priming may involve 
delivering with a first vector of the invention followed by boosting with a second 
vector, or with a composition containing the antigen itself in protein form. In one 
example, the prime-boost regimen can provide a protective immune response to the 
virus, bacteria or other organism from which the antigen is derived. In another 
20 desired embodiment, the prime-boost regimen provides a therapeutic effect that can 
be measured using convention assays for detection of the presence of the condition 
for which therapy is being administered. 

The priming composition may be administered at various sites in the 
body in a dose dependent manner, which depends on the antigen to which the desired 
25 immune response is being targeted. The invention is not limited to the amount or 
situs of lnjection(s) or to the pharmaceutical carrier. Rather, the regimen may 
Involve a priming and/or boosting step, each of which may include a single dose or 
dosage that is administered hourly, daily, weekly or monthly, or yearly. As an 
example, the mammals may receive one or two doses containing between about 1 0 |ig 
to about 50 pg of plasmid in carrier. A desirable amount of a DNA composition 
ranges between about 1 ^g to about 10.000 jig of the DNA vector. Dosages may vary 
from about 1 ng to 1000 pg DNA per kg of subject body weight. The amount or site 
of delivery is desirably selected based upon the identity and condition of the mammal. 
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The dosage unit of the vector suitable for delivery of the antigen to the mammal is 
described herein. The vector is prepared for administration by being suspended or 
dissolved in a pharmaceutically or physiologically acceptable carrier such as isotonic 
saline; isotonic salts solution or other formulations that will be apparent to those 
skilled in such administration. The appropriate carrier will be evident to those skilled 
in the art and will depend in large part upon the route of administration. The 
compositions of the invention may be administered to a mammal according to the 
routes described above, in a sustained release formulation using a biodegradable 
biocompatible polymer, or by on-site delivery using micelles, gels and liposomes. 
Optionally, the priming step of this invention also includes administering with the 
priming composition, a suitable amount of an adjuvant, such as are defined herein. 

Preferably, a boosting composition is administered about 2 to about 
27 weeks after administering the priming composition to the mammalian subject. The 
administration of the boosting composition is accomplished using an effective amount 
of a boosting composition containing or capable of delivering the same antigen as 
administered by the priming DNA vaccine. The boosting composition may be 
composed of a recombinant viral vector derived fix>m the same viral source (e.g., 
adenoviral sequences of the invention) or from another source. Alternatively, the 
''boosting composition'' can be a composition containing the same antigen as encoded 
in the priming DNA vaccine, but in the form of a protein or peptide, which 
composition induces an immune response in the host. In another embodiment, the 
boosting composition contains a DNA sequence encoding the antigen under the 
control of a regulatory sequence directing its expression in a mammalian cell, e.g., 
vectors such as well-known bacterial or viral vectors. The primary requirements of 
the boosting composition are that the antigen of the composition is the same antigen, 
or a cross-reactive antigen, as that encoded by the priming composition. 

In anotiier embodiment, the adenoviral vectors of the invention are 
also well suited for use in a variety of other immunization and therapeutic regimens. 
Such regimens may involve delivery of adenoviral vectors of the invention 
simultaneously or sequentially with Ad vectors of different serotype capsids, 
regimens in which adenoviral vectors of the invention are delivered simultaneously or 
sequentially with non-Ad vectors, regimens in which the adenoviral vectors of the 
invention are delivered simultaneously or sequentially with proteins, peptides, and/or 
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other biologically usefiil therapeutic or immunogenic compounds. Such uses will be 
readily apparent to one of skill in the art. . 

IV. Simian Adenovirus 18 Sequences 

The invention provides nucleic acid sequences and amino acid sequences of 
Ad SA18. which are isolated from the other viral material with which they are 
associated in nature. These sequences are useful in preparing heterologous molecules 
containing the nucleic acid sequences and amino acid sequences, and regions or 
fragments thereof as are described herein, viral vectors which are useful for a variety 
of purposes, including the constructs and compositions, and such methods as are 
described herein for the chimeric adenoviruses, including, e.g.. in host cells for 
production of viruses requiring adenoviral helper functions, as delivery vehicles for 
heterologous molecules such as those described herein., These sequences are also 
useful in generating the chimeric adenoviruses of the invention. 

A. Nucleic Acid Sequences 

The SA18 nucleic acid sequences of the invention include nucleotides 
SEQ ID NO: 12. nt 1 to 31967. See. Sequence Listing, which is inconx,rated by 
reference herein. The nucleic acid sequences of the invention further encompass the 
strand which is complementary to the sequences of SEQ ID NO: 12, as well as the 
RNA and cDNA sequences corresponding to the sequences of these sequences figures 
and their complementao. strands. Further included in this invention are nucleic acid 
sequences which are greater than 95 to 9m, and more preferably about 99 to 99 9% 
homologous or identical to the Sequence Listing. Also included in the nucleic acid 
sequences of the invention are natural variants and engineered modifications of the 
sequences provided in SEQ ID NO: 12 and their complementary' strands. Such 
modifications include, for example, labels that are known in the art, methylation. and 
substitution of one or more of the naturally occurring nucleotides with a degenerate 
nucleotide. 

The invention further encompasses fragments of the sequences of 
SAI8. their complementary strand, cDNA and RNA complementary thereto. Suitable 
fragments are at least 1 5 nucleotides in length, and encompass functional fragments. 
..e., fragments which are of biological interest. For example, a functional fragment 
can express a desired adenoviral product or may be useful in prt)duction of 
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recombinant viral vectors. Such ftagments include the gene sequences and fragments 
listed in the tables below. 

The following tables provide the transcript regions and open reading 
frames in the simian adenovirus sequences of the invention. For certain genes, the 
transcripts and open reading frames (ORFs) are located on the strand complementary 
tothatpresentedlnSEQIDNO:12. See, e.g., E2b. E4 and E2a. The calculated 
molecular weights of the encoded proteins are also shown. 
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The SAl 8 adenoviral nucleic acid sequences are useful as therapeutic 
and immunogenic agents and in construction of a variety of vector systems and host 
5 cells. Such vectors are useful for any of the purposes described above for the 

chimeric adenovirus. Additionally, these SAl 8 sequences and products may be used 
alone or in combination with other adenoviral sequences or firagments, or in 
combination with elements from other adenoviral or non-adenoviral sequences. The 
adenoviral sequences of tiie invention are also useful as antisense delivery vectors, 
10 gene therapy vectors, or vaccine vectors, and in methods of using same. Thus, the 
invention further provides nucleic acid molecules, gene delivery vectors, and host 
cells which contain the Ad sequences of the invention. 

For example, the invention encompasses a nucleic acid molecule 
containing simian Ad ITR sequences of the invention. In another example, the 
1 5 invention provides a nucleic acid molecule containing simian Ad sequences of the 
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tavntion ««odi„g . d«i^ Ad gene p,oiM. still other nucleic .cid molecule 
consOucBd mU^ sequences of the invention will be re««ly .pp.™, „ „„. of 
akill in the .It, in view of the infoimation provided heiein. 

simian Ad gene regions identified h«,i„n,ay 
b. used ,n . vrteor of vectors for delivcy of a heterologous molecule to . cell 
E»m,ples of such molecules and methods of delivety .re provided in Section in 
hetcm. Fpr«t.mple. vectors are generated forexp„sslon of an adenovind capsid 
pr«em (or Itagmen. thereof, for pmposes of generating a viral vector in . p.clt.gh» 
hostcell. Suchvectorsmaybedesignedforexptessionlntnuts. Alternatively. such 
v««o,, are designed to provide cells which stably contain sequences which expt^s 
destred ade«,vi,.l (Unctions, eg., on. or more of Ela. Elb, the tennitul repeat 
sequences, E2^ E2b. E4. E40RF6 region. 

In addition, the adenoviral gene sequences and ftagments the«)f m 
«=ful for providing the helper functions necessa^. for production of helper- 
depend«« vln«es (eg., adenoviral vecto.. deleted of essential functions or adeno- 
«»oc,««i vintses (AAV)). For such production methods, the sfatien «,em,vi„l 
se,„«,c« Of ate Invention sr. utilized in such a meth«l in a „mil.r to titose 

d«c„b«i «,r the hum«, Ad. However, due to tite differences in sequences between 
the sm.»n .de«„,i„, sequences of the invention end tiK,se of human Ad, us. of 
ti» of tite invention essentially eliminate a,e possibility of homologous 

«comb.nati«, wiU, helper Wions in a host cell carrying hum«, Ad E. functions. 
..g.. 293 celh. whtch .my produce infectious adenovimi conhmtlnants during rAAV 
production. s 

H v . o'l^-^ing 'AAV using «le«>vi,al helper functions have 

b«n descnb^i „ lengd, u, .be litctature with hum™ adenovim, serotypes. See. e g 
OS 6.25W9S ^ the r.f.„„c.s cited the„in. See. also. US vL 5 .7 9^. 

""""'^■"°'''"691. Thesemetiiodsmayalsobeusedin 
production of non^tuman serotype AAV, taduding non-human primate AAV 

s.«.W«. The shnian .dem.viral gene sequences of the invention which ptovide ti« 

nec^aty helper functions (e.g., Ela, Elb. E2a and^or E4 0RF6) can be particubrly 

^lmprov«.mgti..«cessa,adenovi™i,i„,ctio„„Ki,eminimi.^^^ 

*. posstbthty of recombimition wid, any other ade«,virus.s present in U« rAAV- 

P«*.gtag cell which are o.pically of human origin. Thus, selected genes orop«, 
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reading frames of the adenoviral sequences of the invention may be utilized in these 
rAAV production methods. 

Alternatively, recombinant adenoviral simian vectors of the invention 
may be utilized in these methods. Such recombinant adenoviral simian vectors may 
include, e.g., a hybrid simian Ad/AAV in which simian Ad sequences flank a rAAV 
expression cassette composed of. e.g., AAV 3' and/or 5' ITRs and a transgene under 
the control of regulatory sequences which control its expression. One of skill in the 
art will recognize that still other simian adenoviral vectors and/or gene sequences of 
the invention will be useful for production of rAAV and other viruses dependent upon 
adenoviral helper. 

In still another embodiment, nucleic acid molecules are designed for 
delivery and expression of selected adenoviral gene products in a host cell to achieve 
a desired physiologic effect. For example, a nucleic acid molecule containing 
sequences encoding an adenovirus Ela protein of the invention may be delivered to a 
subject for use as a cancer therapeutic. Optionally, such a molecule is fortnulated in a 
lipid-based carrier and preferentially targets cancer cells. Such a formulation may be 
combined with other cancer therapeutics (e.g., cisplatin. taxol. or the like). Still other 
uses for the adenoviral sequences provided herein will be readily apparent to one of 
skill in the art. 

In addition, one of skill in the art will readily understand that the Ad 
sequences of the invention can be readily adapted for use for a variety of viral and 
non-viral vector systems for in vitro, ex vivo or in vivo delivery of therapeutic and 
immunogenic molecules, including any of those identified as being deliverable via the 
chimeric adenoviruses of the invention. For example, the simian Ad genome of the 
Invention can be utilized in a variety of rAd and non-rAd vector systems. Such 
vectors systems may include, e.g., plasmids. lentivinises. retroviruses, poxviruses, 
vaccinia viruses, and adeno-associated viral systems, among others. Selection of diese 
vector systems is not a limitation of the present invention. 

The invention fiirther provides molecules useful for production of the 
simian and simian-derived proteins of the invention. Such molecules which cany 
polynucleotides including the simian Ad DNA sequences of the invention can be in 
tfie form of a vector. 



42 



B. Simian Adenoviral Proteins of the Invention 

The invention further provides gene products of the above 
adenoviruses, such as proteins, enzymes, and fragments thei^of, which are encoded 
by the adenoviral nucleic acids of the invention. The invention further encompasses 
SA18 proteins, enzymes, and fragments thereof, having the amino acid sequences 
encoded by these nucleic acid sequences which arc generated by other methods. Such 
proteins include those encoded by the open reading frames identified in the tables 
above, and fragments thereof. 

Thus, in one aspect, the invention provides unique simian adenoviral 
proteins which are substantially pure. i.e.. are free of other viral and ptoteinaceous 
proteins. Preferably, these proteins are at least 10«/o homogeneous, more preferably 
60% homogeneous, and most preferably 95% homogeneous. 

In one embodiment, the invention provides unique simian-derived 
capsid proteins. As used herein, a simian-derived capsid protein includes any 
adenoviral capsid protein that contains a SAI 8 capsid protein or a fi^gment thereof 
as defined above, including, without limitation, chimeric capsid proteins, fusion 
pn,teins. artificial capsid proteins, synthetic capsid proteins, and recombinanUy capsid 
protems. without limitation to means of generating these proteins. 

Suitably, these simian-derived capsid proteins contain one or more 
SA18 regions or fhigments thereof (e.g.. a hexon. penton. fiber or fi^gment thereoQ in 
combination with capsid regions or fragments thereof of different adenoviral 
serotypes, or modified simian capsid proteins or fragments, as described herein A 
"modification of a capsid protein associated with alte,^ tropism" as used herein 
includes an altered capsid protein, i.e. a penton. hexon or fiber protein region or 
fiagment thereof, such as the knob domain of the fiber region, or a polynucleotide 
encoding same, such that specificity is altered. The simian-derived capsid may be 
constructed with one or more of the simian Ad of the invention or another Ad 
serotypes which may be of human or non-human origin. Such Ad may be obtained 
fiom a variety of sources including the ATCC. commerx^ial and academic sources or 
the sequences of the Ad may be obtained from GenBank or other suitable sources. 

The amino acid sequences of the simian adenoviruses penton proteins 
ofthe invention are provided herein. The AdSA 18 penton protein is provided in SEQ 
IDNO: 13. Suitably.anyofthesepentonproteins.oruniquefragmentstheieonmay 
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be utilized for a variety of purposes. Examples of suitable fragments include the 
pentonhavingN-terminal and/or C-terminal truncations of about 50, 100. 150, or 200 
amino acids, based upon the amino acid numbering provided above. Other suitable 
ftagments include shorter, internal. Cterminal. or N-termina! fragments. Furdier. the 
penton protein may be modified for a variety of puq^oses knovm to those of skill'in 
the art. 

The invention further provides the amino acid sequences of the hexon 
P«>tein of SA18. SEQ ID NO:14. Suitably, this hexon protein, or unique ftagments 
thereof, may be utilized for a variety of purposes. Examples of suitable fragments 
include the hexon having N-terminal and/or Cterminal truncations of about 50 100 
150. 200. 300. 400. or 500 amino acids, based upon the amino acid numbering 
provided above and in SEQ ID NO: 14. Other suitable fragments include shorter 
mtemal. Cterminal. or N-terminal fragments. For example, one suitable fragment the 
loop region (domain) of the hexon protein, designated DEI and FGl. or a 
hypervariable region thereof. Such fragments include the regions sp'anning amino 
acid residues about 125 to 443; about 138 to 441 . or smaller fragments, such as those 
spanning about residue 138 to residue 163; about 170 to about 176; about 195 to 
about 203; about 233 to about 246; about 253 to about 264; about 287 to about 297- 
about 404 to about 430. about 430 to 550. about 545 to 650; of the simian hexon 
proteins, with reference to SEQ ID NO: 14. Other suitable fragments may be readily 
.dentified by one of skill in the art Further, the hexon protein may be modified for a 
vanety of purposes known to those of skill in the art. Because the hexon protein is 
the determinant for serotype of an adenovirus, such artificial hexon proteins would 
result in adenoviruses having artificial serotypes. Other artificial capsid proteins can 
also be constructed using the chimp Ad penton sequences and/or fiber sequences of 
the invention and/or fragments thereof. 

In one example, it may be desirable to generate an adenovirus having 
an altered hexon protein utilizing the sequences of a hexon protein of the invention. 
One suitable method for altering hexon proteins is described in US Patent 5.922 315 
which is incon>orated by reference. In this method, at least one loop region'of the ' 
adenovirus hexon is changed with at least one loop region of another adenovirus 
serojype. Thus, at least one loop region of such an altered adenovirus hexon protein 
.s a simian Ad hexon loop region of the invention. In one embodiment, a loop region 



44 



of the SA18 hexon protein is replaced by a loop region fix>m another adenovirus 
serotype. In another embodiment, the loop region of the SA18 hexon is used to 
replace a loop region from another adenovirus serotype. Suitable adenovirus 
serotypes may be readily selected from among human and non-human serotypes, as 
described herein. SA18 is selected for purposes of illustration only; the other simian 
Ad hexon proteins of the invention may be similarly altered, or used to alter another 
Ad hexon. The selection of a suitable serotype is not a limitation of the present 
Invention. Still other uses for the hexon protein sequences of the invention will be 
readily apparent to those of skill in the art. 

The invention further encompasses the fiber proteins of the simian 
adenoviruses of the invention. The fiber protein of AdSA] 8 has the amino acid 
sequence of SEQ ID NO: 15. Suitably, this fiber protein, or unique fragments thereof, 
may be utilized for a variety of purposes. One suitable fragment is the fiber knob, 
which spans about amino acids 247 to 425 of SEQ ID NO: 15. Examples of other 
suitable fragments include the fiber having N-temiinal and/or C-temiinal truncations 
of about 50. 100, 150. or 200 amino acids, based upon the amino acid numbering 
provided above and in SEQ ID NO: 15. Still other suitable fragments include internal 
fiagments. Further, the fiber protein may be modified using a variety of techniques 
known to those of skill in the art. 

The Invention further encompasses unique fragments of the proteins 
of the invention which are at least 8 amino acids in length. However, fragments of 
other desired lengths can be readily utilized. In addition, the invention encompasses 
such modifications as may be introduced to enhance yield and/or expression of an 
SAl 8 gene product, e.g., construction of a fusion molecule in which all or a fragment 
of the SA18 gene product is fused (either directly or via a linker) with a fusion 
partner to enhance. Other suitable modifications include, without limitation, 
truncation of a coding region (e.g.. a protein or enzyme) to eliminate a pre- or pro- 
protein ordinarily cleaved and to provide the mature protein or enzyme and/or 
mutation of a coding region to provide a secretable gene product. Still other 
modifications will be readily apparent to one of skill in the art. The invention further 
encompasses proteins having at least about 95% to 99% identity to the SAI8 proteins 
provided herein. 
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As described herein^ vectors of the invention containing the 
adenoviral capsid proteins of the invention are particularly well suited for use in 
applications in which the neutralizing antibodies diminish the effectiveness of other 
Ad serotype based vectors, as well as other viral vectors. The rAd vectors of the 
invention are particularly advantageous in readministration for repeat gene therapy or 
for boosting immune response (vaccine titers). Examples of such regimens are 
provided herein. 

Under certain circumstances, it may be desirable to use one or more of 
the SAl 8 gene products (e.g., a capsid protein or a fragment thereoQ to generate an 
antibody. TTie term "an antibody/* as used herein, refers to an immunoglobulin 
molecule which is able to specifically bind to an epitope. Thus, the antibodies of the 
invention bind, preferably specifically and without cross-reactivity, to a SAl 8 
epitope. The antibodies in the present invention exist in a variety of forms including, 
for example, high affinity polyclonal antibodies, monoclonal antibodies, synthetic 
antibodies, chimeric antibodies, recombinant antibodies and humanized antibodies. 
Such antibodies originate fix>m immunoglobulin classes IgG, IgM, IgA, IgD and IgE. 

Such antibodies may be generated using any of a number of methods 
know in the art. Suitable antibodies may be generated by well-known conventional 
techniques, e.g. Kohler and Milstein and the many known modifications thereof. 
Similarly desirable hig^ titer antibodies are generated by applying known 
recombinant techniques to the monoclonal or polyclonal antibodies developed to 
these antigens [see, e.g., PCT Patent Application No. PCT/GB85/00392; British 
Patent Application Publication No. GB2188638A; Amit et a/., 1986 Science^ 
233:747-753; Queen a/., 1989 Proc. Nat'LAcad ScL USA, S5: 10029-1 0033; PCT 
Patent Application No. PCT/WO9007861; and Riechmann et al. Nature, 332:323- 
327 (1988); Huse etaU 1988a Science, 2^tf:1275-1281]. Alternatively, antibodies can 
be produced by manipulating the complementarity determining regions of animal or 
human antibodies to the antigen of this invention. See, e.g., E. Mark and Padlin, 
"Humanization of Monoclonal Antibodies", Chapter 4, The Handbook of 
Experimental Pharmacology, Vol. 1 13, The Pharmacology of Monoclonal Antibodies, 
Springer-Verlag (June, 1994); Harlow et al, 1999, Using Antibodies: A Laboratory 
Manual, Cold Spring Harbor Laboratory Press, NY; Harlow etal, 1989, Antibodies: 
A Laboratory Manual, Cold Spring Harbor, New York; Houston et aL, 1988, Proc. 
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NatL Acad. ScL USA 55:5879-5883; and Bird et al, 1988, Science 242:423-426. 
Further provided by the present invention are anti-idiotype antibodies (Ab2) and antt- 
anti-idiotype antibodies (Ab3). See, e.g., M. Wettendorff et al., "Modulation of anti- 
tumor immunity by anti-idiotypic antibodies " In Idiotypic Network and Diseases, ed. 
5 by J. Cemy and J. Htemaux, 1990 J. Am. Soc. MicrobioL^ Washington DC: pp. 203- 
229]. These anti-idiotype and anti-antt-idiotype antibodies are produced using 
techniques well known to those of skill in the art. These antibodies may be used for a 
variety of purposes, including diagnostic and clinical methods and kits. 

Under certain circumstances, it may be desirable to introduce a 
10 detectable label or a tag onto a SA18 gene product, antibody or other construct of the 
invention. As used herein, a detectable label is a molecule which is capable, alone or 
upon interaction with another molecule, of providing a detectable signal. Most 
desirably, the label is detectable visually, e.g. by fluorescence, for ready use in 
immunohistochemical analyses or immunofluorescent microscopy. For example, 
15 suitable labels include fluorescein isothiocyanate (FITC), phycoerythrin (PE), 

allophycocyanin (AFC), coriphosphine-O (CPO) or tandem dyes, PE-cyanin-5 (PCS), 
and PE-Texas Red (ECD). All of these fluorescent dyes are conunercially available, 
and their uses known to the art Other useful labels include a colloidal gold label. 
Still other useful labels include radioactive compounds or elements. Additionally, 
20 labels include a variety of enzyme systems that operate to reveal a colorimetric signal 
in an assay, e.g., glucose oxidase (which uses glucose as a substrate) releases 
peroxide as a product which in the presence of peroxidase and a hydrogen donor such 
as tetramethyl benzidine (TMB) produces an oxidized TMB that is seen as a blue 
color. Other examples include horseradish peroxidase (HRP) or alkaline phosphatase 
25 (AP), and hexokinase in conjunction with glucose-6-phosphate dehydrogenase which 
reacts with ATP, glucose, and NAD+ to yield, among other products, NADH that is 
detected as increased absorbance at 340 nm wavelengdi. 

Other label systems that are utilized in the methods of this invention 
are detectable by other means, e*g., colored latex microparticles [Bangs Laboratories, 
30 Indiana] in which a dye is embedded are used in place of enzymes to form conjugates 
with the target sequences provide a visual signal indicative of the presence of the 
resulting complex in applicable assays. 
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Methods for coupling or associating the label with a desired molecule 
are similarly conventionaj and known to those of skill in the art. Known methods of 
label attachment are described [see, for example. Handbook of Fluorescent probes 
and Research Chemicals, 6th Ed., R. P. M. Haugland, Molecular Probes, Inc., 
5 Eugene, OR, 1 996; Pierce Catalog and Handbook, Life Science and Analytical 
Research Products, Pierce Chemical Company, Rockford, IL, 1994/1995]. Thus, 
selection of the label and coupling methods do not limit this invention. 

The sequences, proteins, and fragments of the invention may be 
produced by any suitable means, including recombinant production, chemical 
10 synthesis, or other synthetic means. Suitable production techniques are well kiiown 
tothoseofsktllintheart. See, e.g., Sambrook et al. Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Press (Cold Spring Harbor, NY), 
Alternatively, peptides can also be synthesized by the well kno)vn solid phase peptide 
synthesis methods (Merrifield, J. Am. Chem, Soc.^ 85:2149 (1962); Stewart and 
15 Young, Solid Phase Peptide Synthesis (Freeman, San Francisco, 1969) pp. 27-62). 
These and other suitable production methods are within the knowledge of those of 
skill in the art and are not a limitation of the present invention. 

In addition, one of skill in the art will readily understand that the Ad 
sequences of the invention can be readily adapted for use for a variety of viral and 
20 non-viral vector systems for in vitro, ex vivo or in vivo delivery of therapeutic and 
immunogenic molecules. For example, in one embodiment, the simian Ad capsid 
proteins and other simian adenovirus proteins described herein are used for non-viral, 
protein-based delivery of genes, proteins, and other desirable diagnostic, therapeutic 
and immunogenic molecules. In one such embodiment, a protein of the invention is 
25 linked, directly or indirectiy, to a molecule for targeting to cells with a receptor for 

adenoviruses. Preferably, a capsid protein such as a hexon, penton, fiber or a fiagment 
thereof having a lig^nd for a cell surface receptor is selected for such targeting. 
Suitable molecules for delivery are selected ftom among the therapeutic molecules 
described herein and their gene products. A variety of linkers including, lipids, 
30 polyLys, and the like may be utilized as linkers. For example, the simian penton 
protein may be readily utilized for such a purpose by production of a fusion protein 
using the simian penton sequences in a manner analogous to that described in 
Medina-Kauwe LK, et ai. Gene Ther. 2001 May; 8(10):79S-803 and Medina-Kauwe 
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LK. et al. Gene net. 2001 Dec; 8(23): 1753-1761. Alternatively, the amino acid 
sequences of simian Ad protein IX may be utilized for taiBeting vectors to a cell 
surface receptor, as described in US Patent Appln 20010047081. Suitable ligands 
include a CD40 antigen, an RGD-containing or polylysine-containlng sequence, and 
the lilce. Still other simian Ad proteins, including, e.g.. the hexon protein and/or the 
fiber protein, may be used for used for these and similar purposes. 

Still other adenoviral proteins of the invention may be used as alone, 
or in combination with other adenoviral protein, for a variety of purposes which will 
be readily apparent to one of skill in the art In addition, still other uses for the 
adenoviral proteins of the invention will be readily apparent to one of skill in the art 

The compositions of this invention include vectors that delivec a 
heterologous molecule to cells, either for therapeutic or vaccine purposes. Such 
vectors, containing simian adenovirus DNA of SA18 and a minigene, can be 
constructed using techniques such as those described herein for the chimeric 
adenoviruses and such techniques as are known in the art Alternatively. SA19 may 
be a source for sequences of the chimeric adenoviruses are described herein. 

The following examples are iUustiative. and are not intended to limit 
the invention to those illustrated embodiments. 

Example 1 - Construction of Pan5/C1 Chimeric Simian Viruses 

Five different adenoviruses initially isolated fix)m the chimpanzee. AdC68 
[US Patent 6.083,716], AdPan5. AdPan7, AdPan6 and Add [US Patent 6,083.716] 
have been sequenced. See. International Application No. PCr/US02/33645. filed 
November 2002 for the sequences of Pan5 (SEQ ID NO: 1]. Pan7 [SEQ ID NO:3], 
and Pane [SEQ ID N0:2]. This application also provides sequences for SVl , SV25 
and SV39 [SEQ ID No. 4. 5. 6. respectively]. Sequence comparison of the cal>sid 
protein sequences predicted that AdCl clearly belonged to a different serological 
subgroup than the other four chimpanzee derived adenovinises. 

However, attempts to cultivate AdCl in HEK293 cells revealed it to be 
festidious in its growth characteristics (data not shown) and tiierefore possibly 
unsuitable for use as a vector using the currentiy available EI complementing cell 
lines. However, because of tfie obvious sequence dissimilarity of AdCl capsid 
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protein sequence from the other chimpanzee derived adenoviruses (as well as the 
huAdS), chimeric adenovirus vectors were generated with the capsid characteristics of 
AdCl. In view of the above-mentioned drawbaclcs associated with only malcing 
hexon changes, more extensive replacements were made in the chimera described 

5 herein, i.e., construction of chimeras where the replacement went beyond just the 
hexon, to achieve two goals. The first was to determine whether making extended 
replacements would allow for the rescue of viruses containing hexons of unrelated 
serotypes that may not otherwise be amenable to rescue. The second goal was to test 
whether the growth characteristics of adenovirus vectors such as AdPanS, that have 

10 been found in our laboratory to be able to be grown to high titer for the purpose of 
manufacture, would also be present in die chimeric virus, particularly when the hexon 
(and other capsid proteins) are derived fiom a virus such as AdCl that are difficult to 
grow to a high yield in cell lines such as HEK293. An added bonus of extending the 
replacement to include the fiber protein would be to further increase the antigenic 

15 dissimilarity to beyond that afforded by a hexon change alone. 

As an alternative to obtaining purified virus as source for adenoviral DNA to 
sequence, we have resorted to cloning restriction fragments of viral DNA obtained 
from infected cells ("Hirt prep")- The first adenovirus we have sequenced in this way 
is Simian Adenovirus. EcoRI digestion of the Simian Adenovirus yielded 7 

20 fragments. Shotgun cloning yielded clones of the 5 internal fragments, which were 
cloned and sequenced. Completion of the sequencing was carried out by walking 
towards each of the ends of the genome. The map of the genome is shown in Figure 
L 

A. Construction of Two PanS/Cl Chimeric Plasmids 
25 The overall approach towards constructing chimeric viruses was to 

first assemble the complete El deleted virus DNA into a single plasmtd flanked by 
recognition sites for the restriction en2yme Swal, digest the plasmid DNA with Swal 
to release the virus DNA ends, and transfect the DNA into HEK293 cells to determine 
whether viable chimeric adenovirus could be rescued. Two chimeric virus plasmids 
30 were constructed, pSClshort and pSCIlong. 

The plasmid pSClshort harbors an El deleted PanS virus where an 
internal 1 5226 bp segment (1 8332 - 33557) has been replaced by a functionally 
analogous 14127 bp (18531 -32657) from AdCI. This results in the replacement of 
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the PanS proteins hexon, endoprotease, DNA binding protein, 100 kD scaffolding 
protein, 33 kD protein, protein VIII, and fiber, as well as the entire B3 region, with 
the homologous segment finom AdCl . The Clal site at the left end of the AdCl 
fragment is at the beginning of the hexon gene and the resulting protein is identical to 
the CI hexon. The EcoRI site which constitutes the right end of the AdCl firagment 
is within the E4 orf 7 part of the AdC 1 . The right end was ligated to a PCR generated 
right end fragment from AdPanS such that the regenerated orf 7-translation product is 
chimeric between AdPanS and AdCl. 

The plasmid pSClIong harbors an El deleted PanS virus where an 
internal 2S603 bp segment (79SS - 33SS7) has been replaced by a functionally 
analogous 24712 bp (7946 *- 326S7) from AdCl . This results in the replacement of 
the AdPanS pre-terminal protein, S2/SS kD protein, penton base protein, protein VII, 
Mu, and protein VI with those from AdCl in addition to those replaced in pSCl short. 
The AscI site at the left end of the AdCl fragment is at the beginning of the DNA 
polymerase gene and results in a chimeric protein where the first 16S amino acids of 
the AdPanS DNA polymerase has been replaced by a 1 67 amino acid segment from 
AdCl DNA polymerase. In this N-termtnal region, the homology between the 
AdPanS and AdCl DNA polymerase proteins is 81% (72% identity). 

The plasmid pDVPSMIu which contains the left end of AdPanS was 
used as the starting plasmid for the chimeric vector construction. 

The plasmid pDVPSMIu was made as follows. A synthetic DNA 
fragment harboring recognition sites for the restriction enzymes Smal, MIuI, EcoRI 
and EcoRV respectively was ligated into pBR322 digested with EcoRI and Ndel so as 
to retain the origin of replication and the beta-lactamase gene. The left end of PanS 
extending to the Mlul site (1 S13S bp) was cloned into this plasmid between the Smal 
and Mlul sites. The El gene was functionally deleted and replaced by a DNA 
fragment harboring recognition sites for the extremely rare cutter restriction enzyme 
sites I-Ceul and Pl-Scel). The 2904 base pairs of the right end of Pan-S was PCR 
amplified using the primers P5L [GCG CAC GCG TCT CTA TCG ATG AAT TCC 
ATT GGT GAT GGA CAT GC, SEQ ID NO:7] and P5ITR [GCG CAT TTA AAT 
CAT CAT CAA TAA TAT ACC TCA AAC, SEQ ID NO:8] using Tgo polymerase 
(Roche). The PCR product was cut with Mlul and Swal, and cloned between Mlul 
and EcoRV of pDVPSMIu to yield pPanSMlu+RE. A 3 193 bp fragment extending 
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from the Mlul site (IS 135) to the Clal (18328) site of PanS was then inserted between 
the same sites of pPanSMlu+RE to yield pPanSCIa+RB. The 3671 bp Cla! (18S31) to 
EcoRI (22202) fragment of the adenovirus CI was cloned into pPanSCIa+RE between 
Clal (161 1 1) and EcoRI (161 16) to yield pPanSCldelRI. The 104S2 bp internal 
EcoRl fragment of the adenovirus CI (22202 - 326S3) was cloned into the EcoRI site 
of pPanSCldelRI to yield pSClshort. To construct pSCllong, tfie AdCl replacement 
was further extended by replacing the AscI - Clal 10379 bp firagnent of AdPanS in 
pSClshort with the AdCl AscI - Clal 10S91 bp fragment Finally a green fluorescent 
protein (GFP) e3q)ression cassette was inserted into both pSClshort and pSCUong 
between the I-Ceul and Pl-Scel sites to yield pSClshortGPP and pSCllongGFP 
respectively. 

B. Rescue of chimeric PanS/Cl recombinant vector adenoviruses 

The plasmids pSClshortGFP and pSCllongGFP were digested with 
the restriction enzyme Swal and transfected into HEK 293 cells. A typical adenovirus 
induced cytopathic effect was observed. The rescue of recombinant chimeric . 
adenovirus from the pSCl longGFP transfection was confirmed by collecting the 
supernatant from the transfection and re-infecting fresh cells which were found to* be 
transduced as determined by GFP expression. Viral DNA prepared from the chimeric 
recombinant virus was digested with several restriction enzymes and found to have 
the expected pattern on electrophoresis (data not shown). 

The chimeric adenoviral construct with the shorter replacement 
pSClshort encodes the CI proteins hexon and fiber as well as the intervening open 
reading frames for endoprotease, DNA binding protein, 100 kDa scaffolding protein, 
33 kDa protein, and protein VIII. (The E3 region is also included within this region 
but is unlikely to impact on the viability of the chimeric virus). When the 
replacement was extended to include the additional AdCl proteins pTP (pre-terminal 
protein), S2/SS kDa protein, penton base, protein VII, Mu, and protein VI, there was 
no difficulty in rescuing viable chimeric virus. In this experiment, the chimeric 
adenovirus construction strategy utilized the presence of AscI and Clal restriction 
enzyme sites present on the genes for DNA polymerase and hexon respectively on 
both AdPanS and AdCl. 

The reasons for the relatively higher yield of the chimeric virus 
compared to the wild-type AdCl virus are not clear. In the growth of the SCI 
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chimeric virus in 293 cells, the adenovical early region gene products of El and E4 
are derived from Ad5 and AdPanS respectively. The El and E4 gene products bind, 
regulate and de-repress several cellular transcription complexes and coordinate their 
activity towards viral multiplication. Thus it is possible that the El gene products 

5 supplied in trans from the 293 cells and the E4 gene products from AdPanS are more 
optimal in die human 293 cell background than are the equivalent AdCl gene 
products. This may also apply to the activity of the major late promoter whose 
activity is responsible for the transcription of the capsid protein genes. In the 
chimeric virus, the major late promoter, and the protein lVa2 which tiansactivates it, 

10 are derived from AdPanS. However the E2 gene products required for adenoviral 
DNA replication pTP and single-stranded DNA - binding protein are derived from 
AdCl. The adenoviral DNA polymerase, which complexes with pTP, is chimeric in 
AdSCl but mostly AdPanS derived. 

1 5 Two plasmids have been constructed where the structural proteins derive 

from the chimpanzee adenovirus Pan 7 and Simian adenovirus 25 respectively, and 
the flanking sequences are derived from human AdS (the commonly used vector 
strain). The AdhuS-Pan7 chimeric adenovirus has been rescued, demonstrating that 
the chimeric virus construction method used to derive the Pan 5 - CI chimeric virus 

20 is broadly applicable. Fig. 2 provides the map of the recombinant Adhu5-SV25 

chimeric virus. The portion of the genome replaced by DNA from Pan? is indicated. 

Example 2 - C5C1 chimeric vector of invention as a delivery vehicle for 
immunogenic compositions 

25 A Pan 5 (Simian adenovirus 22, a subgroup E adenovirus) - CI (Simian 

adenovirus 21 , a subgroup B adenovirus) chimeric expressing the Ebola virus (Zaire) 
glycoprotein (C5C1-CMVGP) was constructed as a model antigen in order to test the 
efficacy of the vector C5C1-CMVGP as a vaccine; this vector has been compared it to 
the AdhuS based vector (H5-CMVGP). Compared to H5-CMVGP, the C5C1- 

30 CMVGP vector yielded only a slightly decreased level of GP expression in 
transduced AS49 cells. 

Thereafter. GP-specific T cell and B cell responses elicited in BIOBR mice 
vaccinated with either H5-CMVGP or CSCl-CMVGP vectors were compared. The 
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CSCUCMVGP vector appeared to induce lower frequencies of gamma interferon 
producing CD8+ T cells with kinetics slower than the HS-CMVGP vector as 
determined by intracellular cytokine staining using a H-2k restricted GP-specific 
peptide as stimulant. The total IgG response to GP, measured by ELISA, was- 
S equivalent in serum from mice vaccinated with the CSCl-CMVGP or the HS* 
CMVGP vectors. However, the C5C1-CMVGP vector induced a more potent Thl 
type response while the H5-CMVGP vector stimulated a more balanced Thl/Th2 type 
response. 

10 Exannple 3 - Generation of Simian PanS/Human Ad40 Chimeric 

Adenovims and Chimpanzee PanS/Simian SA18 Chimeric Adenovirus 

The construction of plasmids designed to rescue chimeric adenoviruses where 
the outside flanking regions are derived from the chimpanzee adenovirus AdPanS, 
and the internal region (containing the structural capsid protein genes) are derived 

IS from the human adenovirus Ad40 and the simian adenovirus SA18, are described 
below. 

As described for the PanS-Cl chimeric adenovirus, the overall approach 
towards constructing chimeric viruses was to first assemble the complete El deleted 
virus DNA into a single plasmid flanked by recognition sites for the restriction 

20 enzyme Swal, digest the plasmid DNA with Swal to release the virus DNA ends, and 
transfect the DNA into HEK293 cells to determine whether viable chimeric 
adenovirus could be rescued. Two chimeric virus plasmids were constructed, pPanS- 
40 and pPanS-SA18 corresponding to the two chimeric adenoviruses referred to 
above. The plasmid pPanS-40 harbors an El deleted PanS virus where an internal 

25 2297S bp segment (1 0400 - 33374) has been replaced by a functionally analogous 

21603 bp (10043 - 21603) from Ad40. This results in the replacement of the AdPanS 
52/55 kD protein, penton base protein, protein VII, Mu, protein VI, hexon, 
endoprotease, DNA binding protein, 100 kD scaffolding protein, 33 kD protein, 
protein VIII, and fiber, as well as the entire E3 region, with the homologous segment 

30 from Ad40. Similarly, the plasmid pPan5-SA18 harbors an El deleted Pan5 virus 
where an internal 22975 bp segment (10400 - 33374) has been replaced by a 
functionally analogous 19015 bp (10573 - 29587) from SA18. This results in the 
replacement of the AdPan5 52/55 kD protein, penton base protein, protein VII, Mu, 
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protein VI, hexon, endoprotease, DNA binding protein, 100 kD scaffolding protein, 
33 kD protein, protein VIII, and fiber, as well as the entire E3 region, with the 
homologous segment from SA18. 

The construction of plasmids designed to rescue chimeric adenoviruses where 
S the outside flanking regions are derived from the chimpanzee adenovirus AdPanS, 
and the internal region (containing the structural capsid protein genes) ai« derived 
from the human adenovirus Ad40 and the simian adenovirus SA18, are described 
below. 

A. Silent mutagenesis ofXbal site: 
^ ° The plasmid pDVPSMlu which contains the left end of AdPanS 

was used as the starting plasmid for the chimeric vector construction. As a first 
step the Xbal site (3820) was mutagenized to destroy the recognition site without 
changing the coding sequence for polymerase. This was done by first sub-cloning 
the Ndel (8 12) to ffindHI (493 1) firagment into the plasmid pNEB 193 (New 
England Biolabs) using the same restriction sites in pNEBl 93, to yield pNEBpS. 
A PGR reaction was performed on pNEBpS usmg the primers P5XTOP 
(GATACCTAGGAACGAGGAGGATTTGATATTG, SEQ ID NO:9) and 
P5XBOT (ATGTACGCCTCCGCGCTCAC, SEQ ID NO:10) to yield a 591 bp 
product. The PGR product was cleaved with AvrH and BbvGI and ligated into 
20 pNEBpS cut with Xbal and BbvCI to yield the desired mutation in the plasmid 
pNEBpSmut. The mutated Ndel-Hindlllfi-agment from pNEBpSmut was ligated 
back into pDVPSMlu to yield the desired mutated plasmid pDVPSMlumut. 
BJnsertion of the Pan5 right end comprising the right ITR and the complete E4 
region: 

The right end of Pan-5 was PGR amplified (P5RE2PCR) using the 
primers P5E4 [GATCGAATTCCCACTCTGTACCCCATCTCTG, SEQ ID NO:l 1] 
and P5ITR [GCG CAT TTA AAT CAT CAT CAA TAA TAT ACC TCA AAC, SEQ 
ID NO:8] using Tgo polymerase, cut with EcoRI and Swal. and clbned between 
EcoRl and EcoRV of pDVPSMlumut to yield pPan5Mlumut+RE. 

2° Insertion ofAd40 or SAI 8 structural protein sequences: 

In order to construct p5-40, the Ad40 segment from Xbal (10038) to 
EcoRI (31642) was ligated into pDVP5Mlumut+RE2 between Xbal (8178) and 
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EcoRJ (12924) in two steps: first, the Xbal (30494) to EcoRI (31642) fragment was 
inserted, followed by the Xbal (10038) - Xbal (30494) fragment. 
To construct pPan5-SA18 the Xbal (10S68) to EcoRl (29S84) fragment from the 
simian adenovirus SA18 was inserted into pDVPSMIumut^RE2 between Xbal (8178) 
S and EcoRI (12924). 

The minigene encoding for green fluorKcent protein was inserted in 
place of the El deletion between the 1-CeuI and Pl-Scel sites in pPan5-40 and pPanS- 
SA18 respectively. This plasmids were purified, digested with Swal and transfected 
into 293 cells. 

10 

EXAMPLE 4 - Generation of Chimeric Pan6/Pan7 Vectors 

A panel of GFP expressing vectors were generated. This panel includes 
vectors that are chimeric between Pan 6 and Pan 7 where the hexon protein or the 
fiber protein or both hexon and fiber proteins of the Pan 7 vector have been replaced 

15 by those from Pan 6. Balb/C mice were immunized with eiAer Pan 6 or Pan 7 and re- 
administration (by tail vein injection) was attempted 3 weeks later using each of the • 
five GFP expressing vectors (C6-GFP, C7-GFP, and the three chimeric vectors). 
Three days later the level of liver transduction was estimated qualitatively by 
examining liver sections for the presence of GFP expression and quantitatively by 

20 estimating copies of GFP DNA by Taqman analysis. The data showed that antibodies 
to both hexon and fiber are important in preventing re-administration of adenoviral 
vectors. 

25 All publications cited in tills specification are incorporated herein by 

reference. While the invention has been described with reference to a particularly 
preferred embodiment, it will be appreciated tiiat modifications can be made without 
departing from the spirit of the invention. Such modifications are intended to fall 
within the scope of tiie appended claims. 
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What is claimed is: 



1 . A method of efTiciently culturing a chimeric adenovirus in a selected host 
celU said chimeric adenovirus being from a parental adenovirus strain incapable of 
efTicient growth in said host cell, said method comprising the steps of: 
(a) generating a chimeric adenovirus comprising: 

(i) adenovirus sequences of the left terminal end and 
right terminal end of a first adenovirus which grows in a selected host cell type, said 
left end region comprising the S' inverted terminal repeat (ITRs), and said right end 
region comprising the 3' inverted terminal repeat (ITRs); and 

(ii) the internal regions from a parental adenovirus 
which lacks its native 5' and 3* terminal regions, said internal regions comprising the 
late genes encoding the penton, hexon, and fiber; 

wherein the resulting chimeric adenovirus comprises, from S' 
to 3% a left terminal region of the ftrst adenovirus, the internal region of the parental 
adenovirus, and the right terminal region of the first adenovirus; and 

b) culturing said chimeric adenovirus in the presence of ftinctional 
adenovirus Ela, Elb, and E4 ORF6 genes from the first adenovirus or from an 
adenovirus serotype which transcomplements the first adenovirus, and further in the 
presence of necessary adenoviral structural genes from the left end of the adenovirus. 

2. The method according to claim I, wherein the internal region of the 
parental adenovirus further comprises one or more ftinctional adenovirus genes 
selected from the group consisting of Endoprotease open reading frame, DNA 
binding protein, 100 kDa scaffolding protein, 33 kDa protein, protein VIII, pTP, 
52/55 kDa protein, protein VII, Mu and protein VL 

3, The method according to claim I, wherein the polymerase, terminal 
protein and 52/55 kDa protein functions are provided in trans. 
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4. The method according to claim U wherein the first adenovirus further 
comprises the polymerase, terminal protein and S2/SS kDa protein functions. 

5. The method according to claim 1, wherein the chimeric adenovirus 
comprises the adenoviral late genes 1, 2» 3, 4, and 5 of the parental adenovirus. 

6. The method according to claim 1, wherein the selected host cell stably 
contains one or more of the adenovirus Ela, Elb or E4 0RF6 functions. 

7. The method according to claim 1/ wherein the chimeric adenovirus 
comprises one or more of the adenovirus Ela. Elb or E4 ORF6 of the first 
adenovirus. 

8. The method according to claim 1, wherein the first adenovirus is of 
human origin. 

9. The method according to claim 1, wherein the first adenovirus is of 
simian origin. 

10. The method according to claim 1 » further comprising the step of isolating 
the chimeric adenovirus. 

1 1 . A method for generating a chimeric adenovirus for growth in a selected 
host cell, said chimeric adenovirus being derived from a parental adenovirus strain 
incapable of efficient growth in said host cell, said method comprising the step of 
generating a chimeric adenovirus comprising: 

S' and 3' terminal regions of a first adenovirus which grows in a 
selected host cell type, said 5' terminal regions comprising the S' inverted terminal 
repeat (ITRs) and necessary El gene functions, and said 3* terminal regions 
comprising inverted terminal repeat (ITRs) and necessary E4 gene functions; and 

internal regions from a parental adenovirus which lacks its native S' 
and 3' terminal regions, said internal regions comprising the hexon, penton base and 
fiber; 
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wherein the resulting chimeric adenovirus comprises, from 5* to 3\ the S' 
terminal region of the first adenovirus, the internal region of the parental adenovirus, 
and the 3' terminal regions of the first adenovirus. 

12. A chimeric adenovirus produced according to the method of claim I. 

13. A chimeric adenovirus comprising a hexon protein of a selected 
adenovirus serotype which is incapable of efficient growth in a selected host cell, 
said modified adenovirus comprising: 

(a) adenovirus sequences of the left terminal end of a first 
adenovirus which grows in a selected host cell type, said left end region comprising 
the Ela, Elb and S' inverted terminal repeat (ITRs); 

(b) adenovirus sequences of the internal region of the selected 
adenovirus serotype which is incapable of efficient growth in the selected host cell, 
said internal region comprising the genes encoding the penton, hexon and fiber of the 
selected adenovirus; 

(c) adenovirus sequences of the right terminal end of the- fust 
adenovirus, said right end region comprising the necessary E4 gene functions and the 
3' inverted terminal repeat (ITRs), 

wherein the resulting chimeric adenovirus comprises 
adenoviral structural and regulatory proteins necessary for infection and replication. 

14. The chimeric adenovirus according to claim 13, wherein the chimeric 
adenovirus further comprises the Ilia, S2/SSkDa and terminal protein (pTP) of the 
selected adenovirus serotype. 

1 5. The chimeric adenovirus according to claim 13, wherein chimeric 
adenovirus comprises the polymerase of the first adenovirus. 

16. The chimeric adenovirus according to claim 13, wherein the chimeric 
adenovirus expresses a ftinctional chimeric protein formed from the first adenovirus 
and the selected adenovirus, said chimeric protein is selected from the group 
consisting of polymerase, terminal protein, 52/55 kDa protein, and Ilia. 
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17. The chimeric adenovirus according to claim 13. wherein the chimeric 
adenovirus comprises the terminal protein, S2/SS kDa, and/or Ilia of the selected 
adenovirus. 

18. A host cell comprising a chimeric adenovirus according to claim 12. 

19. The host cell according to claim 18. wherein said host cell is a human 

cell. 

20. An isolated simian adenovirus nucleic acid sequence selected from the 
group consisting of: 

(a) SA18 having the sequence of nucleic acids 1 to 31967 of SBQ ID 

NO:12and 

(b) a nucleic acid sequence complementary to the sequence of any of 

(a)to(f). 

21. An isolated simian adenovirus serotype nucleic acid sequence selected 
from one or more of the group consisting of: 

(a) S' inverted terminal repeat (ITR) sequences; 

(b) the adenovirus Ela region, or a fragment theieof selected 
from among the 13S, 12S and 9S regions; 

(c) the adenovirus Elb region, or a fragment thereof selected 
from among the group consisting of the small T, large T, DC, and IVa2 regions; 

(d) the E2b region; 

(e) the LI region, or a fragment thereof selected from among 
the group consisting of the 28.1 kD protein, polymerase, agnoprotein, 52/55 kD 
protein, and Ilia protein; 

(f) the L2 region, or a fragment thereof selected from the 
group consisting of the penton, VII, VI, and Mu proteins; 

(g) the L3 region, or a fragment thereof selected from the 
group consisting of the VI, hexon, or endoprotease; 

(h) the 2a protein; 
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(i) the L4 region, or a fragment thereof selected from the 
group consisting of the 1 00 kD protein, the 33 kD homolog, and VIII; 

(i) the E3 region, or a fragment thereof selected from the 
group consisting of E3 ORFI, E3 ORF2, E3 ORF3, E3 ORF4. E3 ORF5. E3 ORF6, 
E3 ORF7. E3 0RF8, and E3 ORF9; 

(k) the L5 region, or a fragment thereof selected from a fiber 

protein; 

(I) the E4 region, or a fragment thereof selected from the 
group consisting of E4 ORF7, E4 ORF6, E4 ORF4, E4 ORF3, E4 ORF2, and E4 
ORFl;and 

(m) the 3' ITR, of any of SA18 SEQ ID NO:12, or a sequence 
complementary to any of (a) to (m). 

22. A simian adenovirus protein encoded by the nucleic acid sequence 
according to claim 21. 

23. A composition comprising a simian adenovirus capsid protein according 
to claim 22 linked to a heterologous molecule for delivery to a selected host cell. 

24. A method for targeting a cell having an adenoviral receptor comprising 
delivering to a subject a composition according to claim 23. 

25. A nucleic acid molecule comprising a heterologous simian adenoviral 
sequence according to claim 21. 

26. The nucleic acid molecule according to claim 25, whei^in said simian 
adenoviral sequence encodes an adenoviral gene product and is operatively linked to 
regulatory control sequences which direct expression of the adenoviral gene product 
in a host cells. 

27. The nucleic acid molecule according to claim 25, wherein said simian 
adenoviral sequence comprises the Ela region of SAI 8 SEQ ID NO:I2. 
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28. A pharmaceutical composition comprising the nucleic acid molecule 
according to claim 27 and a physiologically compatible carrier. 

29. A recombinant adenovirus having a capsid comprising a protein selected 
from the group consisting of. 

.(a) a hexon protein of SA18, SEQ ID NO 13, or a unique fragment 

thereof; 

(b) a penton protein of SA18, SEQ ID NO: 14, or a unique fragment 

thereof; 

(c) a fiber protein of SAl 8, SEQ ID NO: 1 5, or a unique fragment 

thereof. 

30. The recombinant adenovirus according to claim 29, wherein the capsid is 
of an artificial serotype. 

31. The recombinant adenovirus according to claim 29, wherein said virus 
further comprises a heterologous gene operatively linked to sequences which direct 
expression of said gene in a host cell. 

32. The recombinant adenovirus according to claim 29, further comprising 
S' and 3' adenovirus cis-elements necessary for replication and encapsidation. 

33. The recombinant adenovirus according to claim 29, wherein said vector 
lacks all or a part of the El ^ne. 

34. A host cell comprising a heterologous nucleic acid molecule comprising 
the nucleic acid sequence according to claim 21 . 

35. The host cell according to claim 34, wherein said host cell is stably 
transformed with the nucleic acid molecule. 
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36. The host cell according to claim 34, wherein said host cell expresses one 
or more adenoviral gene products from said nucleic acid molecule, said adenoviral 
gene products selected from the group consisting of El a, Elb, E2a, and E4 0RF6. 

37. The host cell according to claim 34, wherein said host cell is stably 
transformed with a nucleic acid molecule comprising the simian adenovirus inverted 
terminal repeats. 

38. A composition comprising a recombinant virus according to claim 29 in 
a pharmaceutically acceptable carrier. 

39. A method for delivering a heterologous gene to a mammalian cell 
comprising introducing into said cell an effective amount of the recombinant virus 
according to claim 29. 

40. A method for repeat administration of a heterologous gene to a manunal 
comprising the steps of: 

(a) introducing into said mammal a first vector which 
comprises the heterologous gene and 

(b) introducing into said mammal a second vector which 
comprises the heterologous gene; 

wherein at least the first virus or the second vector is a 
virus according to claim 29 and wherein the first and 
second recombinant vector are different 

41 . A method for producing a selected gene product comprising infecting a 
mammalian cell with the recombinant virus according to claim 29, culturing said cell 
under suitable conditions and recovering firom said cell culture the expressed gene 
product. 

42. A method for eliciting an immune response in a mammalian host against 
an infective agent comprising administering to said host an effective amount of the 
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recombinant adenovirus of claim 29, wherein said heterologous gene encodes an 
antigen of the infective agent. 

43. The method according to claim 42, comprising the step of priming the 
host with a DNA vaccine comprising the heterologous gene prior to administering 
the recombinant adenovirus. 
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ABSTRACT OF THE DISCLOSURE 



A method for providing an adenovirus from a serotype which does not grow 
S ef¥iciently in a desired cell line witii the ability to grow in that cell line is described. 
The method involves replacing the left and right termini of the adenovirus with the 
corresponding termini from an adenovirus which grow efficiently in the desired cell 
line. At a minimum, the left terminus spans the S' inverted terminal repeat, the left 
terminus spans the E4 region and the 3* inverted terminal repeat. The resulting 
10 chimeric adenovirus contains the internal regions spanning the genes encoding the 
penton, hexon and fiber from the serotype which does not grow efficiently in the 
desired cell. Also provided are vectors constructed from novel simian adenovirus 
sequences and proteins, host cells containing same, and uses thereof.. 
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catcatcaat aatatacctc aaacttttgg cgcgcgttaa tatgcaaatg aggtatttga 
atttggggat gcggggcggt gattggctgc gggagcggcg accgttaggg gcggggcggg 
tgacgttttg atgacgtggc cgtgaggcgg agccggtttg caagttctcg tgggaaaagt 
gacgtcaaac gaggtgtggt ttgaacacgg aaatactcaa ttttcccgcg ctctctgaca 
ggaaatgagg tgtttctggg cggatgcaag tgaaaacggg ccattttcgc gcgaaaactg 
aatgaggaag tgaaaatctg agtaattccg cgtttatggc agggaggagt atttgccgag 
SSccgagtag actttgaccg attacgtggg ggtttcgatt accgtatttt tcacdtaaat 
ttcogcgtac ggtgtcaaag tcc^gtgttt ttacgtaggt gtcagccgat cgccagggta 
tttaaacctg cgctctctag tcaagaggcc actcttgagt gccagcgagt agagftttct 
cctccgcgcc gcgagtcaga tctacacttt gaaagatgag gcacctgaga gacctgcccg 
gtaatgtttt cctggctact gggaacgaga ttctggaact ggtggtggac gccatgatgg 
gtgacgaccc tccggagccc cctadcccat ttgaagcgcc tccgotgtac gatttgtatg 
atctggaggt ggatgtgccc gagaacgacc ccaacgagga ggcggtgaat gatttgttta 
gcgatgccgc gctgctggct gccgagcagg ctaatacgga ctctggctca gacagcgatt 
cctctctcca taccccgaga cccggcagag gtgagaaaaa gatccccgag cttaaagggg 
aagagctcga cctgcgctgc tatgaggaat gcttgcctcc gagcgatgat gaggaggacg 
aggaggogat tcgagctgca gcgaacbagg gagtgaaaac agcgagcgag ggctttagcc 
tggactgtcc tactctgccc ggacacggct gtaagtcttg tgaatttcat cgcatgaata 
ctggagataa gaatgtgatg tgtgccctgt gctatatgag agcttacaac eattgtgttt 
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gtcagtgaag aaatgcctgt tcgagaggtg ccacctgggg gtgatgagcg agggcgaagc 2880 

caaagtcaaa cactgcgcct ctaccgagac gggctgcttt gtactgatca agggcaatge 2940 

caaagtcaag cataatatga tctgtggggc ctcggatgag cgcggctacc agatgctgac 3000 

ctgcgccggt gggaacagcc atatgctagc caccgtgcat gtg^cctcgc acccccgcaa 3060 

gacatggccc gagttcgagc acaacgtcat gacccgctgc aatgtgcacc tggggtcccg 3120 

ccgaggcatg ttcatgccct accagtgcaa catgcaattt gtgaaggtgc tgctggagcc 3180 

cgatgccatg tccagagtga gcctgacggg ggtgtttgac atgaatgtgg agctgtggaa 3240 

aattctgaga tatgatgaat ccaagaccag gtgccgggcc tgcgaatgcg gaggcaagca 3300 

cgccaggctt cagcccgtgt gtgtggaggt gacggaggac ctgcgacccg at'catttggt 3360 

gttgtcctgc aacgggacgg agttcggctc cagcggggaa gaatctgact agagtgagta 3420 

gtgtttggga ctgggtggga gcctgcatga tgggcagaat gactaaaatc tgtgtttttc 3480 

tgcgcagcag catgagcgga agcgcctcct ttgagggagg ggtattcagc ccttatctga 3540 

cggggcgtct cccctcctgg gcgggagtgc gtcagaatgt gatgggatcc aCggtggacg 3600 

gccggcccgt gcagcccgcg aactcttcaa ccctgaccta cgcgaccctg agctcctcgt 3660 

ccgtggacgc agctgccgcc gca^fctgctg cttccgccgc cagcgccgtg cgcggaatgg 3720 

ccctgggcgc cggctactac agctctctgg tggccaactc gagttccacc aataaccccg 3780 

ccagcctgaa cgaggagaag ctgctgctgc tgatggccca gctcgaggcc ctgacpcagc 3840 

gcctgggcga gctgacccag caggtggctc agctgcaggc ggagacgcgg gccgcggttg 3900 

ccacggtgaa aaccaaataa aaaatgaatc aataaataaa cggagacggt tgttgatttt 3960 

aacacagagt cttgaatctt tatttgattt ttcgcgcgcg gtaggccctg gaccaccggt 4020 

cccgatcatt gagcacccgg tggatctttt ccaggacccg gtagaggtgg gcttggatgt 4080 

tgaggtacat gggcatgagc ccgtcccggg ggtggaggta gctccattgc agggcctcgt 4140 

gctcgggggt ggtgttgtaa atcacccagt catagcaggg gcgcagggcg tggtgctgca 4200 

cgatgtcctt gaggaggaga ctgatggcca cgggcagccc cttggtgtag gtgttgacga. 4260 

acctgttgag ctgggaggga tgcatgcggg gggagatgag atgcatcttg gcctggatct 4320 

tgagattggc gatgttcccg cccagatccc gccgggggtt catgttgtgc aggaccacca 4380 

gcacggtgta tccggtgcac ttggggaatt tgtcatgcaa cttggaaggg aaggcgtgaa 4440 

agaatttgga gacgcccttg tgaccgccca ggttttccat gcactcatcc atgatgatgg 4500 

cgatgggccc gtgggcggc^ gcttgggcaa agacgtttcg ggggtcggac acatcgtagt 4560 



tgtggtcctg ggtgagctcg 


tcataggcca 


ttttaatgaa 


tttggggcgg 


agggtgcccg 


4620 


actgggggac gaaggtgccc 


tcgajtcccgg 


gggcgtagtt 


gccctcgcag 


atctgcatct 


4680 


cccaggcctt gagctcggag 


999gggatca 


tgtccacctg 


cggggcgatg 


^aaaaaacgg 


4740 


tttccggggc gggggagatg 


agctgggccg 


aaagcaggtt 


ccggagcagc 


.tgggacttgc 


4800 


cgcagccggt ggggccgtag 


atgaccccga 


tgaccggctg 


caggtggtag 


ttgagggaga 


4860 


gacagctgcc gtcctcgcgg 


ag^agggggg 


ccacctcgtt 


catcatctcg 


cgcacatgca 


4920 


tgttctcgcg cacgagttcc 


gccaggaggc 


gctcgccccc 


aagcgagagg 


agctcttgca 


4980 


gcgaggcgaa gtttttcagc 


ggcttgagcc 


cgtcggccat 


gggcattttg 


gagagggtct 


5040 


gttgcaagag ttccagacgg 


tcccagagct 


cggtgatgtg 


ctctagggca 


tdccgatcca 


5100 


gcagacctcc tcgtttcgcg 


99t^tggggcg 


actgcgggag 


tagggcacca 


ggcgatgggc 


5160 


gtccagcgag gccagggtcc 


gg^ccttcca 


99ggcgcagg 


gtccgcgtca 


gcgtggtctc 


5220 


cgtcacggtg aaggggtgcg 


cgccgggccg 


ggcgcttgcg 


agggtgcgct 


tcaggctcat 


5280 


ccggctggbc gagaaccgct 


cccpgtcggc 


gccctgcgcg 


tcggccaggt 


agcaattgag 


5340 


catgagttcg tagttgagcg 


cctcggccgc 


gtggcccttg 


gcgcggagct 


tacetttgga 


5400 


^g^gtgtccg cagacgggac 


agaggaggga 


cttgagggcg 


tagagcttgg. 


gggcgaggaa 


5460 


gacggactcg ggggcgtagg 


cgtccgcgcc 


gcagctggcg 


cagacggtct 


cgcactccac 
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gagccaggtg aggtctggcc 


ggtcggggtc 


aaaaacgagg 


tttcctccgt 


gctttttgat 


5580 


gcgtttctta cctctggtcb 


ccatgagctc 


gtgtccccgc 


tgggtgacaa 


agaggctgtc 
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cgtgtccccg tagaccgact 


ttatgggccg 


gtcctcgagc . ggggtgccgc 


ggtcctcgtc 


5700 


gtagaggaac cccgcccact 


ccgagacgaa 


ggcccgggtc 


caggccagca 


cgaaggaggc 


5760 


cacgtgggag gggtagcggt 


cgttgtccac 


cagcgggtcc 


accttctcca 


gggdatgcaa 


5820 


gcacatgtcc ccctcgtcca 


ca,tccaggaa 


ggtgattggc 


ttgtaagtgt 


aggccacgtg 


5880 


sccgggggtc ccggccgggg 


gggt^taaaa 


gggggcgggc 


ccctgctcgt 


cctcactgtc 


5940 


ttccggatcg ctgtccagga 


gcgccagctg 


ttggggtagg tattccctct 


cgaaggcggg • 
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catgacctcg gcactcaggt 


tgtcagtttc 


tagaaacgag gaggatttga 


t'attsiacggt: 


6060 


gccgttggag acgcctttca 


tgagcccctc 


gtccatctgg tcagaaaaga 


cga'tcttttt 
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gttgtcgagc ttggtggcga 


aggagccgta 


gagggcgttg gagagcagct 


t^gcgatgga 


6180 


gcgcatggtc tggttctttt 


ccttgtcggc 


gcgctccttg gcggcgatgt 


tgagctgcac 


6240 
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cagttggagg 


aatggctgtt 
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gcgtccgcag tgctcgcaae gctgcacggg atgcacgtgc tgcacgagct gfacccgggt 8040 

tcctttgacg aggaatttca gtgggcagtg gagcgctggc ggctgcatct ggtgctgtac 8100 

tacgtcctgg ccatcggcgt ggccatcgtc tgcctcgatg gtggtcatgc tgacgaggcc 8160 

gcgcgggagg caggtccaga cctcggctcg gacgggtcgg agagcgagga cgagggcgcg 8220 

caggccgeag ctgtccaggg tcctgagacg ctgcggagtc aggtcagtgg gcagcggcgg 8280 

cgcgcggttg acttgcagga gcttttccag ggcgcgcggg aggtccagat ggtacttgat 8340 

ctccacggcg cogttggtgg cgaogtccac ggettgcagg gtcccgtgcc cctggggcgc 8400 

caccaccgug ccccgtttot tcttgggtgc tggcggcggc ggctccatgc ttagaagcgg 8460 

cggcgaggac gogcgccggg cggcaggggc ggctcggggc ccggaggcag gggcggcagg 8520 

Sgcacgtcgg cgccgcgcgc gggfcaggttc tggtactgcg cccggagaag actggcgtga 8580 

gcgacgacgc gacggttgac gtcctggatc tgacgcctct gggtgaaggc cacgggaccc 8640 

9tgagtttga acctgaaaga gagttcgaca gaatcaatct cggtatcgtt gacggcggcc 8700 

tgccgcagga tctcttgcac gtc^eccgag ttgtcctggt aggcgatctc ggtcatgaac 8760 

.:gctcgatct cctcctcctg aaggtctccg cgaccggcgc gctcgacggt gge'cgcgagg 8820 

tcgttggaga tgcggcccat gagctgcgag aaggcgttca tgccggcctc gttccagacg 8880 

cggctgtaga ccacggctcc gtcg^gtcg cgcgcgcgca tgaccacctg ggcgaggttg 8940 

agctcgacgt ggcgcgtgaa gaccgcgtag ttgcagaggc gctggtagag gtagttgagc 9000 

9tggtggcga tgtgctcggt gacgaagaag tacatgatcc agcggcggag cggcatctcg 906O 

ctgacgtcgc ccagggcttc caagcgctcc atggcctcgt agaagtccac ggcgaagttg 9120 

aaaaactggg agttgcgcgc cgagacggtc aactcctcct ccagaagacg ^atgagctcg 9180 

Scgatggtgg cgcgcacctc gcgotcgaag gccccggggg gctcctcttc ^tccatctcc 9240 

tcctcctctt ccatctcctc cactaacatc tcttctactt cctcctcagg aggi:ggcggc 9300 

99gggagggg ccctgcgtcg ccggcggcgc acgggcagac ggtcgatgaa gcgctcgatg 9360 

gtctccccgc gccggcgacg catggtctcg gtgacggcgc gcccgtcctc gcggggcegc- ' 9420 

agcgtgaaga cgccgccgcg catctccagg tggccgccgg gggggtctcc gttgggcagg 9480 

Sagagggcgc tgacgatgca tcttatcaat tggcccgtag g^actcogcg cakggacctg 9540 

agcgtctcga gatccacggg atccgaaaac cgctgaacga aggcttcgag ccagtcgcag 9600 

tcgcaaggta ggctgagccc ggtttcttgt tcttcgggta tttggtcggg aggagggcgg 9660 



gcgatgctgc tggtgatgaa gttgaagtag gcggtcctga gacggcggat ggtggcgagg 9720 

agcaccaggt ccttgggccc ggcttgctgg atgcgcagac ggtcggccat gecccaggcg 9780 

tggtcctgac acctggcgag gtccttgtag tagtcctgca tgagccgctc cacgggcacc 9840 
tcctcctcgcccgcgcggcc gtgcatgcgc gtgagcccga acccgcgctg cggctggacg 9900 
agcgccaggt cggcgacgac gcgctcggcg aggatggcct gctggatctg ggtgagggtg 9960 

gtctggaagt cgtcgaagtc gacgaagcgg tggtaggctc cggtgttgat ggtgtaggag 10020 

cagttggcca tgacggacca gttgacggtc tggtggccgg ggcgcacgag ctcgtggtac 10080 

ttgaggcgcg agtaggcgcg cgtgtcgaag atgtagtcgt tgcaggtgcg cacgaggtac 10140 

tggtatccga cgaggaagtg cggcggcggc tggcggtaga gcggccatcg ctcggtggcg 10200 

ggggcgccgg gcgcgaggtc ctcgagcatg aggcggtggt agccgtagat gtacctggac 10260 

atccaggtga tgccggcggc ggtggtggag gcgcgcggga actcgcggac gcggttccag 10320 

atgttgcgca gcggcaggaa gtagttcatg gtggccgcgg tctggcccgt gaggcgcgcg 10380 

cagtcgtgga tgctctagac atacgggcaa aaacgaaagc ggtcagcggc tcgS^etccgt 10440 

ggcctggagg ctaagcgaac gggttgggct gcgcgtgtac cccggttcga gtccctgctc 10500 

gaatcaggct ggagccgcag ctaacgtggt actggcactc ccgtctcgac ccaagcctgc 10560 

taacgaaacc tccaggatac ggaggcgggt cgttttggcc attttcgtca ggccggaaat 10620 

gaaactagta agcgcggaaa gcggccgtcc gcgatggctc gctgccgtag tctggagaaa 10680 

gaatcgccag ggttgcgttg cggtgtgccc cggttcgagc ctcagcgctc ggcgccggcc 10740 

ggattccgcg gctaacgtgg gcgtggctgc cccgtcgttt ccaagacccc ttagccagce 10800 

gacttcteca gttacggagc gagcccctct ttttcttgtg tttttgdcag atgcatcccg 10860 

tactgcggca gatgcgcccc caccct'ccac cacaaccgcc cctaccgcag cagcagcaac 10920 

agccggcgct tctgcccccg ccccagcagc agcagccagc cactaccgcg gcggccgccg 10980 

tgagcggagc cggcgttcag tatgacctgg ccttggaaga gggcgagggg ctggcgcggc 11040 

tgggggcgtc gtcgccggag cggcacccgc gcgtgcagat gaaaagggac gctcgcgagg iliOO 

cctacgtgcc caagcagaac ctgttcagag acaggagcgg cgaggagccc gaggagatgc 11160 

gcgcctcccg cttccacgcg gg&c:gggagc tgcggcgcgg cctggaccga aagcgggtgc 11220 

tgagggacga ggatttcgag gcggacgagc tgacggggat cagccccgcg cgcgcgcacg 11280 

tggccgcggc caacctggtc acggcgtacg agcagaccgt gaaggaggag agcaacttcc 11340 

aaaaatcctt caacaaccac gtgcgcacgc tgatcgcgcg cgaggaggtg accctgggcc 11400 



tgatgcacct gtgggacctg ctggaggcca tcgtgcagaa ccccacgagc aagccgctga 11460 

cggcgcagct gtttctggtg gt.gcagcaca gtcgggacaa cgagacgttc agggaggcgc 11520 

tgctgaatat caccgagccc gagg[gccgct ggctcctgga cctggtgaac attctgcaga 11580 

gcatcgtggt gcaggagcgc .gssctgccgc tgtccgagaa gctgs[cggcc atcaacttct 11640 

cggtgctgag cctgggcaag tactacgcta ggaagatcta caagaccccg tacgtgccca 11700 

tagacaagga ggtgaagatc gacgggtttt acatgcgcat gaccctgaaa gtgctgaccc 11760 

tgagcgacga tctgggggtg taccgcaacg acaggatgca ccgcgcggcg agc§ccagcc 11820 

gccggcgcga gctgagcgac caggagctga tgcacagcct gcagcgggcc ctgaccgggg 11880 

ccgggaccga gggggagagc tact'ttgaca tgggcgcgga cctgcgctgg cagcctagcc 11940 

gccgggcckt ggaagctgcc ggcggttccc cctacgtgga ggaggtggac gatgaggagg 12000 

aggagggcga gtacctggaa gactgatggc gcgaccgtat ttttgctaga tgcagcaaca 12060 

gccaccgccg cctcctgatc ccgcgatgcg ggcggcgctg cagagccagc cgtccggcat 12120 

taactcctcg gacgattgga cccaggccat gcaacgcatc atggcgctga cgacccgcaa 12180 

tcccgaagcc tttagacagc agcctcaggc caaccgactc tcggccatcc tggaggccgt 12240 

ggtgccctcg cgctcgaacc ccacgcacga gaaggtgctg gccatcgtga acgcgctggt 12300 

ggagaacaag gccatccgcg gcgacgaggc cgggctggtg tacaacgcgc tgctggagcg 12360 

cgtggcccgc tacaacagca ccaacgcgca gacgaacctg gaccgcatgg tgaccgacgt 12420 

gcgcgaggcg gtgtcgcagc gcgagcggtt ccaccgcgag tcgaacctgg gctccatggt 12480 

ggcgctgaac gccttcctga. gcacgcagcc cgccaacgtg ccccggggcc aggaggacta 12540 

caccaacttc atcagcgcgc tgcggctgat ggtggccgag gtgccccaga gcga:ggtgta 12600 

ccagtcgggg ccggacbact tcttccagac cagtcgccag ggcttgcaga ccgtgaacct 12660 

gagccaggct ttcaagaact tgcagggact" gtggggcgtg caggccccgg tcggggaccg 12720 
cgcgacggtg tcgagcctgc tgacgccgaa ctcgcgcctg ctgctgctgc tggtggcgcc 12780 
cttcacggac agcggcagcg tgagccgcga ctcgtacctg ggctacctgc ttaacctgta i2840 
ccgcgaggcc atcgggcagg cgcacgtgga cgagcagacc taccaggaga tcacccacgt 12900 
gagccgcgcg ctgggccagg aggacccggg caacctggag gccaccctga acttcctgct 12960 
. gaccaaccgg tcgcagaaga tcccgcccca gtacgcgctg agcaccgagg aggagcgcat 13020 
cctgcgctac gtgcagcaga gcgtggggct gttcctgatg caggaggggg ccacgcccag 13080 
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cgccgcgctc gacatgaccg cgcgc^acat ggagcccagc atgtacgccc gc^ccgccc 13140 
gttcatcaat aagctgatgg actacttgca tcgggcggcc gccatgaact cggactactt 13200 
taccaacgcc atcttgaacc cgcactggct cccgccgccc gggttctaca cgggcgagta 13260 
cgacatgccc gaccccaacg acgggttcct gtgggacgac gtggacagca gcgtgttctc 13320 
gccgcgcccc accabcacca ccgtgtggaa gaaagagggc ggggaccggc ggccgtcctc 13380 
ggcgctgtcc ggtcgcgcgg gtgctgccgc ggcggtgccc gaggccgcca gccccttccc 13440 
gagcctgccc ttttcgctga acagcgtgcg cagcagcgag ctgggtcggc tgacgcggcc 13500 
gcgcctgctg ggcgaggagg agtacctgaa cgactccttg cttcggcccg agcgcgagaa 13560 
gaacttcccc aataacggga tagagagcct ggtggacaag atgagccgct ggaagacgta 13620 
cgcgcacgag cacagggacg agccccgagc tagcagcagc accggcgcca cfccgtagacg 13680 
ccagcggcac gacaggcagc ggggtctggt gtgggacgat gaggattccg ccgacgacag 13740 
cagcgtgttg gacttgggtg ggagtggtgg tggtaacccg ttcgctcacc tgcgcccccg 13800 
tatcgggcgc ctgatgtaag aatctgaaaa aataaaagac ggtactcacc aaggccatgg 13860 
cgaccagcgt gcgttcttct ctgttgtttg tagtagtatg atgaggcgcg tgtacccgga 13920 
gggtcctcct ccctcgtacg agagcgtgat gcagcaggcg gtggcggcgg cgatgcagcc 13980 
cccgctggag gcgccttacg tgcc6ccgcg gtacctggcg cctacggagg ggcggaacag 14040 
cattcgttac tcggagctgg cacccttgta cgataccacc cggttgtacc tggtggacaa 14100 
caagtcggcg gacatcgcct cgc^tgaacta ccagaacgac cacagcaact tcctgaccac 14160 
cgtggtgcag aacaacgatt tca.cccccac ggaggccagc acccagacca tcaactttga 14220 
cgagcgctcg cggtggggcg gccagc|:gaa aaccatcatg cacaccaaca tigcccaacgt 14280 
gaacgagttc atgtacagca acaagttcaa ggcgcgggtg atggtctcgc gcaagacccc 14340 
caacggggtc acagtaacag atggtagtca ggacgagctg acctacgagt gggtggagtc 14400 
tgagctgccc gagggcaact tctcggtgac catgaccatc gatctgatga acaacgccat 14460 
catcgacaac tacttggcgg tggggcggca gaacggggtg ctggagagcg acatcggcgt 14520 
gaagttcgac acgcgcaact tccggctggg ctgggacccc gtgaccgagc tggtgatgcc 14580 
gggcgtgtac accaacgagg ccttccaccc cgacatcgtc ctgctgcccg gctgcggcgt 14640 
ggacttcacc gagagccgcc tcagcaacct gctgggcatc cgcaagcggc agcccttcca 14700 
ggagggcttc cagatcctgt acgaggacct ggaggggggc aacatccccg cgctgctgga 14760 
cgtggacgcc tacgagaaaa gcaaggagga tagcgccgcc gcggcgaccg cagccgtggc 14820 



caccgcctct accgaggtgc ggggcgataa ttttgctagc gccgcgacac tggcagcggc 14880 

cgaggcggct gaaaccgaaa gtaagatagt gatccagccg gtggagaagg aeagcaagga 14940 

gaggagctac aacgtgctcg cggacaagaa aaacaccgcc taccgcagct ggtaectggc 15000 

ctacaaotac ggcgaceccg aga*igggggt gcgctcctgg acgctgctca ccacctcgga 15060 

cgtcacctgc ggcgtggagc aagtotactg gtcgctgcce gacatgatgc aigacccggt isi20 

caccttccge tccacgcgtc aagttagcaa ctacccggtg gtgggcgccg agctcctgcc 15180 

cgtctaetce aagagcttct tcaacgagca ggccgtctao tcgcagcagc tgcgbgcctt 15240 

cacctcgctc acgcacgtct tcaaccgctt ecccgagaac cagatcctcg ctcgcccgcc 15300 

cgcgcccacc attaccaccg tcagtgaaaa cgttcctgct ctcacagatc acgggaccct 15360 

gccgctgcgc agcagtatcc ggggagtcca gcgcgtgacc gtcactgacg ccagacgccg 15420 

oacctgcccc tacgtetaca aggccccggg cgtagtcgcg ccgcgcgtcc tctcgagccg 15480 

caccttctaa aaaatgtcca ttctcatctc gcccagtaat aacaccggtt ggggcctgog 15540 

cgcgccoagc aagatgtacg gaggcgctcg ccaacgctcc acgcaacacc ccgtgcgcgt 15600 

gcgcgggcac ttccgcgctc cctggggcgc cctcaagggc irgcgtgcgct c^cgcaccac 15660 

cgtcgacgac gtgatcgacc aggtggtggo cgacgcgogc aactacacgc ccgcegccgc 15720 

gcccgtctcc aecgtggacg ocgtcatcga cagcgtggtg gccgacgcgc gccggtacge 15780 

ccgcgccaag agccggcggc ggcgcatcgc ccggcggcac cggagcaccc ccgccatgcg 15840 

cgcggcgcga gccttgctgc gcagggccag gcgcacggga cgcagggcca .tgctcagggc 15900 

Sgccagacgc gcggcctccg gcagcagcag cgcoggcagg acccgcagac gcgcggccac 15960 

ggcggcggcg gcggccatcg ccagcatgtc ccgcccgcgg cgcggcaacg tgtactgggt 16020 

gcgcgacgcc gccaccggtg. tgcgcgtgcc cgtgcgcacc cgcccccctc gcacttgaag 16080 

atgctgactt cgcgatgttg atgtgtccca gcggcgagga ggatgtcoaa gcgcaaattc 16140 

aaggaagaga tgctccaggt catcgcgcct gagatctacg gcccggcggc ggtg^aggag 16200 

gaaagaaagc cccgcaaact gaagcgggtc aaaaaggaca aaaaggagga ggaagatgtg 16260 

gacggactgg tggagtttgt gcgogagttc gccccccggc ggcgcgtgca gtggcgcggg 16320 

cggaaagtga aaccggtgct gcgacccggc accacggtgg tcttcacgcc cggcgagcgt 16380 

tccggctccg cctccaagcg ctcbtacgac gaggtgtacg gggacgagga catcctcgag 16440 

caggcggccg aacgtctggg cgagtttgct tacggcaagc gcagccgccc cgcgcccttg 16S00 
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aaagaggagg cggtgtccat ccogctggac cacaaeaaf.^ 

actggac cacggcaacc eeacgecgag cctgaagccg 16560 

aegaccctgc agcaggtgct gc6tggtgeg gcgccgcgcc ggggctPaa gcgcgagggc X66ao 

ggcgaggatc tgtacccgac catgcagctg a.ggtgccca agcgccagaa gctggaggao ^6680 

gtgctggagc acatgaaggt ggaCcccgag gtgcagcccg aggtcaaggt gctfgcccatc 16740. 

aagcaggtgg ccccgggcct ggpgtgcag accgtggaca tcaagatccc cacggagccc 16800 

atggaaacgc agaccgagcc cgtgaagccc agcaccagca ccatggaggt gc.gacggat 16860 

ccctggatgc cggcaccggc ttccaccacc cgccgaagac gcaagtacgg cgcggccagc 16920 

ctgctgatgc ccaactacgc gctgcatcct tccatcatcc ccacgccggg ctaccgcggc 16980 

acgcgcttct accgcggcta caccagcagc cgccgccgca agaccaccac ccgccgccgc 17040 

cgtcgtcgca cccgccgcag oag.accgcg acttocgccg ccgccctggt gcggagagtg 17100 

taccgcagcg ggcgcgagcc tctgaccctg ccgcgcgcgc gctaccaccc gagcatcgcc 17160 

atttaactac cgcctcctac ttgcagatat ggccctcaca tgccgcc.cc gc^tccccat 17220 

tacgggctac cgaggaagaa- agccgcgccg tagaaggcg acggggaacg ggctgcgtcg 17280 

ccatcaccac cggcggcggc gcgccatcag caagcggttg gggggaggct tcctgcccgc 17340 

gctgatgcoc atcatcgccg cggcgatcgg ggcgatcccc ggca.agctt ccgtggcggt 17400 

gcaggcctct cagcgccact gagacacagc ttggaaaatt tgtaataaaa aatggactga i,.60 

cgctcctggt cctgtgatgt gtgtttttag atggaagaca Ccaacttttc gtccctggca 17520 

ccgcgacacg gcacgcggcc gtttatgggc acctggagcg acatcggcaa cagpcaactg 1V580 

aacgggggcg ccttcaattg gag.agtctc tggagcgggo ttaagaattt .gggtccacg 17640 

ctcaaaacct atggcaacaa ggcgtggaac agcagcacag ggcaggcgct gagggaaaag 17700 

ctgaaagagc agaacttcca gcagaaggtg gtcgatggcc tggcctcggg catcaacggg 17760 

Stggtggacc Cggccaacca gg.cgtgcag aaacagatca acagccgcct ^gacgcggtc 17820 

ccgcccgcgg ggtccgtgga gatgccccag gtggaggagg agctgcctcc cctggacaag 17880 

c9oggcgaca agcgaccgcg tcccgacgcg gaggagacgc tgctgacgca cacggacgag 17940 

ccgcccccgt acgaggaggc ggtgaaactg ggtctgccca ccacgcggcc cgtggcgcct laooo 

ctggccaccg gggtgctgaa accoagcagc egcagcagcc agcccgcgac cctggacttg 18060 

cctccgcctg cctcccgccc ctccacagtg gctaagoccc tgccgccggt" g^ccgtcgcg 18120 

tcgcgcgccc cccgaggccg cccccaggcg aactggcaga gcactctgaa cagcatcgtg 18180 

Sgtctgggag tgcagagtgt g,agcgccgc cgctgctatt aaaagacact gtagcgctca 18240 



II 



acttgcttgt ctgtgtgtat atgtatgtcc gecgaecaga aggaggagga agaggcgcgt 18300 
cgccgagttg caagatggcc aeeccatcga tgctgcccca gtgggcgtac atgcacatcg 18360 
ccggacagga cgcttcggag tacctgagtc cgggtctggt gcagttogcc cgcgccacag 1B420 
acacctactt cagtctgggg aacaagttta ggaaccecac ggtggcgccc acgcacgatg 18480 
tgaccaccga ccgcagccag cggctgaegc tgcgcttcgt gcccgtggac cgcgaggaca 18540 

acacctactc gtacaaagtg cgctacacgc tggccgtggg egacaaccgc gtgctggaca 18600 

tggccagcac ctactttgac atccgcggcg tgctggatcg gggccctagc ttcaaaccct 18660 

actccggcac cgcttacaac agcctggctc ccaagggagc gcccaaoact Ugccagtgga 18720 

catataaagc tgatggtgat actggtacag aaaaaaccta tacatatgga aatgcgcctg 18780 

tgeaaggcat tagtattaca aaagatggta ttcaacttgg aactgacact gdtgatcagc 18840 

ccatttatgc agataaaact takcSaccag agcctcaagt gggtgatgct gaatggcatg 18900 

acatcactgg tactgatgaa aaatWtggag gcagagctct caagcctgac accaaaatga 18960 

agccctgcta tggttctttt gccaagccta ccaataaaga aggaggtcag gcaaatgtga 19020 

aaaccgaaac aggcggtacc aa^gaatatg acattgacat ggcattcttc gataatcgaa 19080 

gtgcagctgc ggctggcctg gccccagaaa ttgttttgta tactgagaat gtggatctgg 19140 

aaactccaga tactcatatt gtatacaagg- cgggcacaga tgacagcagc tcttctatca 19200 

atttgggtca gcagtccatg cccaacagac ccaactacat tggctttaga gacaacttta 19260 

tcgggctcat gtactacaac agcaptggca acatgggcgt gctggctggt caggcctccc- 19320 

agctgaatgc tgtggtggac ttgcaggaca gaaacactga actgtcetac cagctcttgc 19380 

ttgactctct gggcgacaga accaggtatt teagtatgtg gaatcaggcg gtggacagct 19440 

atgaccccga tgtgcgcatt a^tgaaaatc acggtgtgga ggatgaactc cctaacfcatt 19500 

gcttccccot ggatgctgtg ggta^aactg atacttacca gggaattaag gccaatggtg 19560 

ctgatcaaao cacctggacc aaagatgata ctgttaatga tgctaatgaa ttgggcaagg 19620 

gcaatccttt cgccatggag- atcaacatcc aggccaacct gtggc^aac ttcctctacg 19680 . 

cgaacgtggc gctgtacctg cccgactcct acaagtacac gccggceaac atcacgctgc 19740 

cgaecaacac caacacctac gattacatga acggccgcgt ggtggcgccc tcgctggtgg 19800 

acgcctacat caacatcggg gcgcgctggt cgctggaccc catggacaac gtcaaccccc 19860 

tcaaccacca ccgcaacgcg ggectgcgct accgctccat gctcctgggc aacgggcgct 19920 
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acgtgccctt ccacatecag gtgccqcaaa agttcttcgc catcaagagc ctcctgctcc 19980 

tgcccgggtc ctacacctac gagtggaact tccgcaagga cgtcaacatg atcctgcaga 20040 

gctccctcgg caacgacctg cgcacggacg gggcctcoat cgccttcacc agcatcaacc 20100 

tctacgccac cttcttcccc atggcgcaca acaccgcctc cacgctcgag gccatgctgc aoaeo 

gcaacgacac caacgaccag tccttcaacg" actacctctc ggcggccaac atgotctacc 20220 

ccatcccggc caacgccacc aacgtgccca tctccatccc ct;cgcgcaac tgggccgcct 20280 

tccgcggatg gtccttcacg cgcctcaaga cccgcgagac gccctcgctc ggctccgggt 20340 

tcgaococta cttcgtctac tcgggctcca tcccctaect cgacggcacc ttotaoctca 20400 

accacacctt caagaaggtc tccatcacct tcgactcctc cgtcagctgg cccggcaacg 20460 

accgcctcct gacgcccaac gagttcgaaa tcaagcgcac cgtcgacgga gaggggtaca 20S20 

acgtggccca gtgcaacatg aocaaggact ggttcctggt ccagatgctg gcccactaoa 20580 

acatcggcta ccagggcttc taigtgcccg agggctacaa ggaccgcatg t^ctccttct 20640 

tccgcaaett ccagcccatg agcogccagg togtggacga ggtcaactac aaggactacc 20700 

aggccgtcac cctggcctac cagcacaaca actcgggctt cgtcggctac ctegcgccca 20760 

ccatgcgcca gggacagccc tacbccgcca actaccccta cccgcteato ggcaagagcg 20820 

ccgtcgccag cgtcacccag aaaaagttcc tctgcgaccg ggtcatgtgg cgcatcccct 20B80 

tctccagcaa cttcatgtcc atgggcgoge tcaccgaoct cggccagaac atgctctacg 20940 

ccaaetccgc ccacgcgcta gacacgaatt tcgaagtcga ccccatggat gagtccaccc 21000 

etctctatgt tgtcttcgaa gtcttcgacg tcgtccgagt gcaccagccc caccgcggcg 21O6O 

tcatcgaggc cgtctacctg cgcacgccct tctcggccgg caacgccacc acctaagcce 21120 

cgctcttgct tcttgcaaga tgacggcctg tgcgggctcc ggcgagcagg agctcagggc 21180 

catcctccgc gacctgggct gdgggccctg ctccctgggc accttcgaca agcgcttccc 21240 

gggattcatg gocccgcaca agctggcctg cgccaccgcc aacacggccg gcc^cgagac 21300 

cgggggcgag cactggctgg ccttcgcctg gaacccgcgc tcccacacct gctacctctt 21360 

cgaccccttc gggttctcgg acgagcgcct caagcagatc taccagttcg agtacgaggg 21420 

cctgctgcgc cgcagcgccc tggccaccga ggaccgctgc gtcaccctgg aaaagtccac 21480 

ccagaccgtg cagggtccgc gctcggccgc ctgcgggctc ttctgctgca tgttcetgca 21540 

cgccttcgtg cactggcccg accgccccat ggacaagaac cccaccatga acttgetgac 21600 

gggggtgccc aacggcatgc tccagtcgcc ccaggtggaa cccaccctgc gccgcaacca 21660 
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ggaggcgotc taccgcttcc tcaacgccca ctccgcctac tttcgctccc accgi:gcgc9 21720 
catcgagaag gccaccgcct tcgaccgcat gaatcaagac atgtaaaccg tgtgtgtatg 2178O 
tgaatgottt attcataata aacagcacat gtttatgcca ccttttctga g^ctctgact 21840 
ttatttagaa atcgaagggg ttctgccggc tctcggegtg ccccgcgggc agggatacgt 21900 
tgcggaactg gtacttgggc agcc.acttga actcggggat cagcagcttc ggdaogggga 2196O 
Sgtcggggaa cgagtcgctc cacagpttgc gcgtgagttg cagggcgccc agcaggtcgg 22020 
gcgcggagat cttgaaatcg caghtgggac ccgcgttctg cgcgcgggag ttgcggtaca 22080 
cggggttgca gcactggaac accatdaggg ccgggtgctt cacgctcgcc agcaccgtcg 22140 
cgtcggtgat gccctccacg tccagatcct cggcgttggc catcccgaag ggggtcatct 22200 
tgcaggtctg ccgccccatg ctgggcacgc agccgggctt gtggttgcaa tcgcigtgca 22260 
9399gatcag catcatctgg gectgitogg agctcatgcc cgggtacatg gccttcatga 22320 
aagcctccag ccggcggaag gcctgctgcg ccttgccgcc ctcggtgaag aagaccccgc 22380 
aggacttgct agagaactgg ttggtggcgc agccggcgtc gtgcacgcag cagcgcgcgt 22440 
cgttgttggc c^gctgcacc acgctgcgcc cccagcggtt ctgggtgatc ttggcccggt 22500 
cggggttcto cttcagcgcg cgctgcccgt tctcgctcgc cacatccatc tcgatcgtgt 22560 
gctccttctg gatcatcacg gtccc^tgca ggcaCcgcag cttgccctcg gcptcggtgc 22620 
acccgtgcag ccacagcgcg cagccggtgc actcccagtt cttgtgggcg atctgggagt 22680 
gcgagtgcac gaagccctgc aggaagcggc ccatcatcgt ggtcagggtc ttgttgctgg 22740 
tgaaggtcag cgggatgcog cggtgotect cgttcacata caggtggcag atgcggcggt 22800 
acacctcgcc ctgctcgggc atcagctgga aggcggactt caggtcgctc tccaegcggt 22860 
accggtccat cagcagcgtc atgacttcca tgcccttctc ccaggccgag acgatcggca 22920 
ggctcagggg gttcttcacc gccgttgtca tcttagtogc cgcogctgag gtc^gggggt 22980 
cgttctcgtc cagggtctoa aacactcgct tgccgtcctt ptcggtgatg cgcacggggg 23040 
gaaagotgaa gcccacggcc gccagctcet cctcggcctg cctttcgtcc tcgctgtcct 23100 
ggctgatgte ttgcaaaggc acatgcttgg tcttgcgggg tttctttttg ggcg^rcagag 23160 
9cggcggcgg agacgtgctg ggcgagcgcg agttctcgct caccacgact attt;cttct:t 23220 
cttggccgcc gtccgagacc acgcggcggt aggcatgcct cttctggggc agaggcggag 23280 
gcgacgggct ctcgcggttc ggcgggcggc tggcagagcc ccttccgcgt tcgggggtgc 23340 
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gctcctggcg gcgctgctct gactgacttc ctccgcggcc ggccattgtg ttctcctagg 23400 
gagcaacaag catggagact cagecatcgt cgccaacatc gccatctgcc cccgccgccg 23460 
ccgacgagaa ccagcagcag aatgaaagct taaccgcccc gccgcccagc cccacctccg 23520 
aogccgccgc ggccccagac atgcaagaga tggaggaatc catcgagatt gacctgggct 23580 
acgtgacgcc cgcggagcac gaggaggagc tggcagcgcg cttttcagcc ccggaagaga 23640 
accaccaaga gcagccagag caggaagcag agagcgagoa gcagcaggct gggctcgagc 23700 
atggcgacta cctgagcggg gcagaggacg tgctcatcaa gcatctggcc cgccaatgca 23760 
tcatcgtcaa ggacgcgctg ctcgaocgcg ccgaggtgcc cctcagcgtg gc^agctca 23820 
gecgcgccta cgagcgcaac ctcttctcgc cgcgcgtgcc ccccaagegc cagcccaacg 23880 
gcaccegcga gcccaacccg cgcctcaact tctacccggt cttcgcggtg cccgkggccc 23940 
tggccaccta ccacctcttt ttcaagaaco aaaggatccc cgtctcctgc cgcgccaacc 24000 
gcacccgcgc cgacgccctg ctcaacctgg gtcceggcgc ccgcctacct gatatcgcct 24060 
ccttggaaga ggttcccaag atcttcgagg gtctgggcag cgacgagact cgggccgcga 24120 
acgctctgca aggaagcgga gaggagcatg agcaocacag cgccctggtg gagttggaag 24180 
gcgacaacgc gcgcctggcg gtgctcaagc gcacggtega gctgacccac ttcgcctacc 24240 
cggcgctcaa cctgcccccc aaggtcatga gcgcogtcat ggaccaggtg ctcatcaagc 24300 
gcgcctcgcc cetctcggat gaggacatgc aggaccccga gagctcggac gagggcaagc 24360 
ccgtggtcag cgacgageag ctggcgcget ggctgggagc gagtagcacc ccccagagct 24420 
tggaagagcg gcgcaagctc atgatggccg tggtcctggt gaccgtggag ctg^agtgtc 24480 
tgcgccgctt cttcgcogao gcagagaccc tgcgeaaggt cgaggagaac ctgqactacc 24540 
tcttcaggca cgggtttgtg cgccaggcct gcaagatctc caacgtggag ctgaccaacc 24600 
tggtctccta catgggcatc ctgcacgaga accgcctggg gcagaacgtg ctgcacacca 24660 
ccctgcgcgg ggaggcccgc cgcgactaca tccgcgactg cgtctacctg tacbtctgcc 24720 
acacctggca gacgggcatg ggcgtgtggc agcagtgcct ggaggagcag aacctgaaag 24780 
agccctgcaa gctcctgcag aagaacotga aggccctgtg gaccgggttc gacgagcgca 24840 
ccaccgcctc ggacctggcc gacctcatct tccccgagcg cctgcggctg apgctgcgca 24900 
acggaccgcc cgactttatg agtcaaagca tgttgcaaaa etttcgctct tccatcctcg 24960 
aacgctccgg gatcctgccc gccacc.gct ccgcgetgcc ctcggacttc gtgccgctga 25020 
ccttccgcga gtgccccccg ccgctctgga gccactgcta cctgctgcgc ctggccaact 25080 
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acctggccta ccactcgaac gtgatcgagg acgtcagogg cgagggtetg ctcgagtgcc 251.0 
actgccgctg caacctctgc acgccgcacc gctccctggc ctgcaacccc cagctgctga 2S200 
gcgagaccca gatcatcggc a.cttcgagt tgcaaggccc cggogagggc aaggggggtc 25260 
.tgaaactcac cccggggctg tggacctcgg cctacttgcg caagttcgtg cccgaggaot 25320 
accatccctt cgagatcagg ttctacgagg accaatccca gccgcccaag gccgaactgt 25380 
cggcctgcgt catcacccag ggggccatcc tggcccaatt gcaagccatc cag^aatccc 254« 
gccaagaatt tctgctgaaa. aagggccacg gggtctacct ggacccceag accggagagg 25500 
agcccaaccc cagcttcccc caggatgccc cgaggaagca gcaagaagct gaaagtggag 255.0 
ctgccgccgc cggaggattt ggaggaagac tgggagagca gtcaggcaga ggaggaggag 25620 
atggaagact gggacagcac tcaggcagag gaggacagcc tgcaagacag tctggaagac 25680 
gaggtggagg aggaggcaga ggaagaagca gccgccgeca gaccgtcgtc ctcggcggag 25740 
aaagcaagca gcacggatac catctccgct ccgggtcggg gtcgcggcga ccgggcccac 25800 
agtaggtggg acgagaccgg gcgcttcccg aaccccacca cccagaccgg taagaaggag 25860 
cggcagggat acaagtcctg gcgggggcac aaaaacgcca tcgtctcctg ctt^caagcc- 25920 
tgcgggggca acatctcctt cacc.gccgc tacctgctct tccaccgcgg •ggtgaactto 25980 
ccccgcaaca tcttgcatta ctaccgtcac ctccacagcc cctac.actg .ttccaagaa. 26040 
gaggcagaaa cccagcagca gcagaaaacc agcggcagca gcagctagaa aatccacagc 26100 
99cggcaggt ggactgagga tcgcagcgaa cgagccggcg cagacccggg agctgaggaa 26160 
ccggatcttt cccaccccct atg^catctt ccagcagagt cgggggcagg agcaggaact 26220 
gaaagccaag aaccgttctc tgcgctcgct caccogcagt tgtctgtatc acaagagcga 26280 
agaccaactt cagcgcactc tcgaggacgo cgaggctctc Ctcaaoaagt actgcgcgct 26340 
cactcctaaa gagcagcccg cgcccgccca cacacggaaa aaggcgggaa ttacgtcacc 26400 
acctgcgocc ttcgcccgac catcatoatg agcaaagaga ttcccacgcc ttacatgtgg 26460 
agctaccagc cccagat:ggg cctggbogcc ggcgccgccc aggaetacto cacccgcatg 26520 
aactggctca gcgccgggcc cgogatgatc tcacgggtga atgacatccg cgcccgccga 26580 
aaccagatac tcctagaaca gtcagcgatc accgccacgc cccgccatca cctt.atccg 26640 
cgtaattggc ccgccgccct ggtgtaccag gaaattcccc agcccacgac cgtactactt 26700 
ccgcgagaog cccaggccga agtccagctg actaactcag gtgtccagct ggccggcggc 26.60 



16 



gccgccctgt gtcgtcaccg ccccgctcag ggtataaagc ggetggtgat ccgaggcaga 26820 

ggcacacagc tcaacgacga ggtggtgagc tcttcgctgg gtctgcgacc tgaeggagtc 26880 

ttccaactcg ccggatcggg gagktcttcc ttcacgcctc gtcaggccgt cctgactttg 26940 

gagagttcgc cetcgcagcc ccgctcgggt ggcatcggca ctctccagtt cgtggaggag 27000 

ttcactccct cggtctactt caaccccttc tccggotccc ccggccacta cccggacgag 27060 

ttcatcccga aottcgacgc catcagcgag tcggtggacg gctacgattg aatgtcccat 27120 

ggtggcgcag ctgacctagc tqggcttcga cacctggacc actgccgccg cttccgctgc 27180 

ttogctcggg atctcgccga gtttgcctae tttgagctgo ccgaggagca ccitcagggc 27240 

ccggcccacg gagtgcggat catcgtcgaa gggggcctcg actcccacct gcttcggatc 27300 

ttcagccagc gaccgatcct ggtcgagcgc gagcaaggac agacccttct gaccctgtac 27360 

tgcatctgca accaccccgg cctgcatgaa agtctttgtt gtctgetgtg tactgagtat 27420 

aataaaagct gagatcagcg actactccgg actcgattgt ggtgttcctg ctateaaccg 27480 

gtccctgtto ttcaccggga acgagaccga gctecagett cagtgtaagc cccacaagaa 27540 

gtacotcacc tggctgttcc agggctcccc gatcgccgtt gtcaacoact gcgacaacga 27600 

cggagtcctg ctgagcggcc ccgccaacct tactttttcc acccgcagaa gcaagctcca 27660 

gctcttccaa cccttcctcc ccggg^ccta tcagtgcgtc tcgggaccct gtfcatcacac 27720 

cttccacctg atcccgaata ccacagcgcc gctccccgct actaacaacc aaactaccea 27780 

ccatcgccao cgtcgcgacc tttctgaatc taacactacc acccacaccg gaggtgagct 27840 

ccgaggtcga ccaacctctg ggatttaota cggcccctgg gaggtggtgg ggttaatagc 27900 

gctaggecta gttgtgggtg ggcttttggc tctctgctac ctatacctcc cttgctgttc 27960 

gtacttagtg gtgctgtgtt gctggtttaa gaaatgggga agatcaccct agtgagctgc 28020 

S9tgcgctgg tggcggtggc ggfcgttttcg attgtgggac tgggcggcgc ggctgtagtg 28080 

aaggagaagg ccgatccctg cttgcatttc aatcccgaca attgccagct gagttttcag 28140 

cccgatggca. atcggtgcgc ggtgctgatc aagtgcggat gggaatgcga gaacgtgaga 28200 

atcgagtaca ataacaagao tcggaacaat actctcgcgt. ccgtgtggca gcbcggggac 28260 

cccgagtggt acaccgtctc tgtccccggt gctgacggct ccccgcgcac cgtgaacaat 28320 

actttcattt ttgcgcacat gtgcgacacg gtcatgtgga tgagcaagca gtacgatatg 28380 

tggcccccca cgaaggagaa catcgtggtc ttctccatcg cttacagcgc gcgcacggcg 28440 

ctaatcaccg ctatcgtgtg cctgagcatt cacatgctca tcgctatccg ccccagaaat 28500 
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aatgccgaaa aagagaaaca gccataacac gttttttcac acaccttttt cagaccatgg 28560 
cctctgttaa atttttgctt ttatttgcca gtctcattac tgttataagt aatgagaaac 28620 
tcactattta cattggcact aaccacactt tagacggaat tccaaaatcc tcatggtatt 28680 
gc.tattttga tcaagatcca gacttaacta tagaactgtg tggtaacaag ggaaaaaata 28740 
caagcattca tttaattaac tttaattgcg gagacaattt gaaattaatt aatatcacta 28800 
aagagtatgg aggtatgtat tactatgttg cagaaaataa caacatgcag ttttatgaag 28860 
ttactgtaac taatcccacc acacctagaa caacaacaac caccaccaca aaaactacac 28920 
ctgttaccac tatgcagctc actaccaata acatttttgc catgcgtcaa atggtcaaca 28980 
atagcactca acccacccca cccagtgagg aaattcccaa atcca^gatt ggcattattg 29040 
ttgctgtagb ggtgtgcatg ttgatcatcg ccttgtgcat ggtgtacfcat gccttctgct 29100 
acagaaagca cagactgaac gacaagctgg aacacttact aagtgttgaa ttttaatttt 29160 
ttagaaccat gaagatccta ggcctbttaa ttttttctat cattacctct gctctatgca 29220 
attctgacaa tgaggacgtt actgtcgttg tcggaaccaa ttatacactg aaaggtccag 29280 
cgaagggtat gctttcgtgg tattgctggt ttggaactga cgagcaacag acagagctct 29340 
gcaatgctca aaaaggcaaa acctcaaatt ctaaaatctc taattatcaa tgcaatggca 29400 
ctgacttagt actgctcaat gtcacgaaag catatgctgg cagctacacc tgccctggag '29460 
atgatactga gaacatgatt titttacaaag tggaagtggt tgatcccact actccacctc 29520 
cacccaccac aactactcac accacacaca cagaacaaac cacagcagag gaggcagcaa 29580 
agttagcctt gcaggtccaa gacagttcat ttgttggcat tacccctaca cctgatcagc 29640 
ggtgtccggg gctgctcgtc agcggcattg tcggtgtgct ttcgggatta gcagtcataa 29700 
tcatctgcat gttcattttt gcttgctgct atagaaggct ttaccgacaa aaatcagacc 29760 
cactgctgaa cctctatgtt taattttttc cagagccatg aaggcagtta gcactctagt 29820 
tttttgttct ttgattggca ctgtttttag.tgttagct.tt ttgaaacaaa tcaatgttac 29880 
tgagggggaa aatgtgacac tggtaggcgt agagggtgct caaaatacca cctggacaaa 29940 
attccatcta gatgggtgga aagaaatttg cacctggaat gtcagtactt atacabgtga 30000 
aggagttaat cttaccattg tcaatgtcag ccaaattcaa aagggttgga t'taaagggca 30060 
atctgttagt gttagcaata gtgggtacta tacccagcat actcttatct atgacattat 30120 
agttatacca ctgcctacac ctagcccacc tagcactacc acacagacaa cccacactac 30180 
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ctcctttgtg 


actitaccGcb 


yw V wwyacc w 
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31800 


ccccttatcc agtgaaataa 


agatcatatt 


gatgatttga 


gtttaataaa 


aataaagaat 


31860 


cacttacttg aaatctgata 


ccaggtctct 


gtccatgttt 


tctgccaaca 


ccagttcaGt 


31920 
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cccctcttcc cagctctggt actgcaggcc 


ccggcgggct 


gcaaacttcc 


tccacaccct 


31980 


gaaggggatg tcaaattcct: cptgtccctc 


aatcttcatt 


ttatcttcta 


tcagatgtcc 


32040 


aaaaagcgcg tccgggtgga tgatgacttc 


gaccccgtct 


acccctacga 


tgcagacaac 


32100 


gcaccgaccg tgcccttcat caaccccccc 


ttcgtctctt. cagatggatt 


ccaagagaag 


32160 


cccctggggg tgctgtccct gcgtctggcc 


gatcccgtca 


ccaccaagaa 


cggggaaatc 


32220 


accctcaagc tgggagatgg ggtggacctc 


gactcctcgg 


gaaaactcat 


ctccaacacg 


32280 


gccaccaagg ccgccgcccc tctcagtttt 


tccaacaaca 


ccatttccct 


taacatggat 


32340 


accccttttt acaacaacaa tggaaagttia 


ggcatgaaag 


tcactgctcc 


actgaagata 


32400 


ctagacacag acttgctaaa aacacttgtt 


gtagcttatg 


gacaaggttt 


aggaacaaac 


32460 


accactggtg cccttgttgc ccaactagca 


tccccacttg 


cttttgatag 


caatagcaaa 


32520 


attgccctta atttaggcaa tggaccattg 


aaagtggatg 


caaatagact 


gaacatcaat 


32580 


tgcaatagag gactctatgt tactaccaca 


aaagatgcac 


tggaagccaa 


tataagttgg 


32640 


gctaatgcta tgacatttat aggaaatgcc 


atgggtgtca 


atattgatac 


acaaaaaggc 


32700 


ttgcaatttg gcaccactag taccgtcgca 


gatgttaaaa 


acgcttaccc 


catacaaatc 


32760 


aaacttggag ctggtctcac atttgacagc 


acaggtgcaa 


ttgttgcatg 


paacaaagat 


32820 


gatgacaagc ttacacbatg gaccacagcc 


gacccctctc 


caaattgtca 


catatattct 


32880 


gaaaaggatg ctaagcttac actttgcttg 


acaaagtgtg 


gcagtcagat 


tctgggcact 


32940 


gtttccctca tagctgttga taqtggcagt 


ttaaatccca 


taacaggaac 


agtaaccact 


33000 


gctcttgtct cacttaaatt cgatgcaaat 


ggagttttgc 


aaagcagctc 


aacactagac 


33060 


tcagactatt ggaatttcag acagggagat 


gttacacctg 


ctgaagccta 


tactaatgct 


33120 


ataggtttca tgcccaatct aaaagcatac 


cctaaaaaca 


caagtggagc 


tgcaaaaagt 


33180 


cacattgttg ggaaagtgta cctacatggg 


gatacaggca 


aaccactgga 


cctx:attatt 


33240 


actttcaatg aaacaagtga tgaajtcttgc 


acttactgta 


ttaactttca 


atggcagtgg 


33300 


ggggctgatc aatataaaaa tgaaacactt 


gccgtcagtt 


cattcacctt 


ttc'ctat'att 


33360 


gctaaagaat aaaccccact ctgtacccca 


tctctgtcta 


tggaaaaaac 


tctgaaacac 


33420 


aaaataaaat aaagttcaag tgttttattg 


attcaacagt 


tttacaggat 


tcgagcagtt 


33480 


atttttcctc caccctccca ggacatggaa 


tacaccaccc 


tctccccccg 


cacagccttg 


33540 


aacatctgaa tgccattggt gatggacatg 


cttttggtct 


ccacgttcca 


cacagtttca 


33600 



gagcgagcca gtctcgggtc ggtcdgggag atgaaaccct eegggcaetc ccgcatctgc 33660 

acctcacagc tcaacagctg aggattgtec tcggtggtcg ggatcacggt ta'tctggaag 33720 

aagcagaaga gcggcggtgg gaatcatagt ccgcgaacgg gatcggeegg tggtgtcgca 33780 

tcaggccccg cagcagtcge tgtcgccgec gctccgtcaa gctgotgctc agggggcccg 33840 

ggtccaggga ctccctcagc atgatgccea cggccctcag catcagtcgt ctggtgcgge 33900 

gggcgcagca gcgcatgcgg atctcgctca ggtcgctgca gtacgtgcaa eaeaggacca 33960 

ccaggfctgtt caacagtcca ta^ttcaaea cgctccagcc gaaactcatc gcgggaagga 34020 

tgctacccac gtggccgtcg taccagatcc tcaggtaaat caagtggcgc clccetceaga 34080 

acacgcegcc catgtacatg atctccttgg gcatgtggcg gttcaccacc tcccggtacc 34140 

acatcaecct ctggttgaac atgcagcccc ggatgatcct gcggaaccac agggocagca 34200 

ccgccccgcc cgccatgcag cgaagagacc ccgggtcccg acaatggcaa t&gaggacco 34260 

accgctcgta cccgtggatc atctgggagc tgaacaagtc tatgttggca cagcacaggc 34320 

atatgctcat gcatctcttc agcactctca gctcctcggg ggtcaaaaco atatcooagg 34380 

gcacggggaa ctcttgcagg acagcgaacc ccgcagaaca gggcaatcct cgcapataac 34440 

ttacattgtg catggacagg gtatcgcaat caggcagcac cgggtgatcc tccaccagag 34500 

aagcgcgggt ctcggtctcc. tcacagcgtg gtaagggggc cggccgatac gggcgatggc 34560 

gggacgcggc tgatcgtgtt cgcgaccgtg tCatgatgca gttgctttcg gkcattttcg 34620 

tacttgctgt agcagaacct ggcccgggcg ctgcacaccg atcgccggcg gcggteccgg 34680 

cgcttggaac gctcggcgtt gaagttgtaa aacagccact ctctcagacc gtgckgcaga 34740 

tctagggcct caggagbgat gaagatccoa tcatgcctga tggctctaat eacatcgacc 34800 

accgtggaat gggccagaoc cagccagatg atgcaatttt gttgggtttc ggtgacggcg 34860 

ggggagggaa gaacaggaag aaccatgatt aacttttaat ceaaacggtc tcggagcact 34920 

ccaaaatgaa gatcgcggag atggcacctc tcgccccegc tgtgttggtg gaaaataaca 34980 

gccaggtcaa aggtgatacg- gttctcgaga tgttccacgg tggcttccag caaagcc.tcc 35040 

acgcgeacat ccagaaacaa gaeaatageg aaagcgggag ggttctctaa ttcctcaatc 35100 

atcatgttac actcctgcac catccceaga taattttcat ttttccagcc tfcgaatgatt 35160 

cgaactagtt cctgaggtaa atccaagcca gccatgataa agagctcgcg cagagcgccc 35220 

tccaccggca ttcttaagca cacectcata attccaagat attctgctcc tggttcacct 35280 
gcagcagatt gacaagcgga atatcaaaat ctctgccgcg atccctaagc tcctccctca . 35340 
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gcaataactg taagtactct ttcatatcct ctccgaaatt tttagccata ggaccaccag 35400 

gaataagatt agggcaagcc acagtacaga taaaccgaag tcctccccag tgagcattgc 35460 

caaatgcaag actgctataa gca^tgctggc tagacccggt gatatcttcc agataactgg, 35520 

acagaaaatc gcccaggcaa tttttaagaa aatcaacaaa agkaaaatcc tccaggtgca 35580 

cgtttagagc ctcgggaaca acgatggagt aaatgcaagc ggtgcgttcc agcatggtta 35640 

gttagctgat ctgtagaaaa aaacaaaaat gaacattaaa ccatgctagc ctggcgaaca 35700 

ggtgggtaaa tcgttctctc cagcaccagg caggccacgg ggtctccggc acgaccctcg 35760 

taaaaatcgt cgctatgatt gaaaaccatc acagagagac gttcccggtg gccggcgtga 35820 

atgattcgac aagatgaata cacccccgga acattggcgt ccgcgagtga aaaaaagcgc 35880 

ccaaggaagc aataaggcac tacaacgctc agtctcaagt ccagcaaagc gatgccatgc 35940 

ggatgaagca caaaattctc aggtgcgtac aaaatgtaat tactcccctc ctgcacaggc 36000 

agcaaagccc ccgatccctc caiggtacaca tacaaagcct: cagcgtccat agcttaccga 36060 

gcagcagcac acaacaggcg caagagtcag agaaaggctg agctctaacc tgtccacccg 36120 

ctctctgctc aatatatagc ccagatctac actgacgtaa aggccaaagt ctaaaaatac 36X80 

ccgccaaata atcacacacg cccagcacac gcccagaaac cggtgacaca ctcaaaaaaa 36240 

tacgcgcact tcctcaaacg cccaaactgc cgtcatttcc gggttcccac gctacgtcat 36300 

caaaattcga ctttcaaatt ccgtcgaccg ttaaaaacgt cgccogcccc gcccctaacg 36360 

gtcgccgctc ccgcagccaa tcaccgcccc gcatccccaa attcaaatac ctcatttgca 36420 

tattaacgcg caccaaaagi: ttgaggtata ttattgatga tg 36462 

<210> 2 
<211> 36604 
<212> DNA 

<213> Chimpanzee adexioviarus serotype Pan6 
<400>- 2 . • 

catcatcaat aatatacctc aa.actttt:gg tgcgcgttaa tatgcaaatg agctgtttga 60 
atttggggag ggaggaaggt gattggctgc gggagcggcg accgttaggg gcggggcggg 120 
tgacgttttg atgacgtggc tatgaggogg agccggtttg caagttctcg tgggaaaagt 180 
gacgtcaaac gaggtgtggt ttgaacacgg aaatactcaa ttttcccgcg ctctctgaca 240 
ggaaatgagg tgtttctggg cggatgcaag tgaaaacggg ccattttcgc gcgaaaactg 300 
aatgaggaag tgaaaatctg agtaatttcg cgtttatggc agggaggagt atttgccgag 360 

22 



ggccgagtag actttgaccg attacgtggg ggtttcgatt accgtatttt tcacptaaat 420 

ttccgcgtac ggtgtcaaag tccgg^'gttt ttacgtaggc gtcagctgat cgccagggta 480 

tttaaacctg cgctctctag tcaagaggcc actcttgagt gccagcgagt agagttttct 540 

cctccgcgcc gcgagtcaga tctacacttt gaaagatgag gcacctgaga gacctgcccg 600 

gtaatgtttt cctggctact gggaacgaga ttctggaatt ggtggtggac gccatgatgg 660 

gtgacgaccc tccagagccc cctaccccat ttgaggcgcc ttcgctgtac gatttgtatg 720 

atctggaggt ggatgtgccc gagagcgacc cbaacgagga ggcggtgaat gatttgttta 780 

gcgatgccgc gctgctggct gccgagcagg ctaatacgga ctctggctca gacagcgatt 840 

cctctctcca taccccgaga cccggcagag gtgagaaaaa gatccccgag cttaaagggg 900 

aagagctcga cctgcgctgc tatgaggaat gcttgcctcc gagcgatgat gaggaggacg 960 

aggaggcgat tcgagctgcg gtgaaccagg gagtgaaaac tgcgggcgag agctt.tagcc 1020 

tggactgtcc tactctgccc ggacacggct gtaagtcttg tgaatttcat cgcatgaata 1080 

ctggagataa gaatgtgatg tgtgccctgt gctatatgag agcttacaac cattgtgttt 1140 

acagtaagtg tgattaactt tagttgggaa ggcagagggt gactgggtgc tgactggttt 1200 

atttatgtat atgttttttt atgtgtaggt- cccgtctctg acgtagatga gacccccact 1260 

tcagagtgca tttcatcacc cccagaaatt ggcgaggaac cgcccgaaga tattattcat 1320 

agaccagttg cagtgagagt caccgggcgg agagcagctg tggagagttt ggatgacttg 1380 

ctacagggtg gggatgaacc tttggacttg tgtacccgga aacgccccag gcactaagtg 1440 

ccacacatgt gtgtttactt aaggtgatgt cagtatttat agggtgtgga gtgcaataaa 1500 

atccgtgttg actttaagtg cgtgttttat gactcagggg tggggactgt gggtatataa 1560 

gcaggtgcag acctgtgcgg tcagttcaga gcaggactca tggagatctg gactgtcttg 1620 

gaagactttc accagactag acagttgcta gagaactcat cggagggagt ctcttacctg 1680 

tggagattct gcttcggtgg gcctctagct aagctagtct atagggccaa acaggattat 1740 

aaggaacaat ttgaggatat tttgagagag tgtcctggta tttttgactc tctcaacttg 1800 

ggccatcagt ctcactttaa ccagagtatt ctgagagccc ttgacttttc tactcctggc 1860 

agaactaccg ccgcggtagc cttttttgcc tttattcttg acaaatggag tcaagaaacc 1920 

catttcagca gggattaccg tctggactgc ttagcagtag ctttgtggag aacatggagg 1980 

tgccagcgcc tgaatgcaat ctccggctac ttgccagtac agccggtaga cacgct:gagg 2040 
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atcctaagtc tccagtcacc ccaggaacac caacgccgcc agcagccgca gckggagcag 2100 

cagcaagagg aggaccgaga agagaacccg agagccggtc tggaccctcc ggtggcggag 2160 

gaggaggagt agctgacttg tttcccgagc tgcgccgggt gctgactagg tcttccagtg 2220 

gacgggagag ggggattaag cgggagaggc atgaggagac tagccacaga .'actgaactga 2280 

ctgtcagtct gatgagccgc aggfcgcccag aatcggtgtg gtggcatgag .gtgcagtcgc 2340 

aggggataga tgaggtctcg gtgatgcatg agaaatattc cctagaacaa gtcaagactt 2400 

gttggttgga gcccgaggat gattgggagg tagccatcag gaattatgcc aagctggctc 2460 

tgaagccaga caagaagtac aagattacca aactgattaa tatcagaaat tcctgctaca 2520 

tttcagggaa tggggccgag gtggagatca gtacccagcfa gagggtggcc ttcagatgtt 2560 

gtatgatgaa tatgtacccg ggggtggtgg gcatggaggg agtcaccttt atgaacacga 2640 

ggttcagggg tgatgggtat aatggggtgg tctttatggc caacaccaag ctgacagtgc 2700 

acggatgctc cttctttggc ttcaataaca tgtgcatcga ggcctggggc agtgtttcag 2760 

tgaggggatg cagcttttca gccaactgga tgggggtcgt gggcagaacc aagagcaagg 2820 

tgtcagtgaa gaaatgcctg ttcgagaggt gccacctggg ggtgatgagc gagggcgaag 2880 

ccaaagtcaa acactgcgcc .tctaccgaga cgggctgctt tgtgctgatc aagggcaatg 2940 

cccaagtcaa gcataacatg atctgtgggg cctcggatga gcgcggctac cagatgctga 3000 

cctgcgccgg tgggaacagc catatgctgg ccaccgtgca tgtggcctcg cacccccgca 3060 

agacatggcc cgagttcgag cacaacgtca tgacccgctg caatgtgcac ctgggctccc 3120 

gccgaggcat gttcatgccc taccagtgca acacgcaatt tgtgaaggtg ctgctggagc 3180 

ccgatgccat gtccagagtg agcctgacgg gggtgtttga catgaatgtg gagctgtgga 3240 

aaattctgag atatgatgaa tccaagacca ggtgccgggc ctgcgaatgc ggaggcaagc 3300 

acgccaggct tcagcccgtg tgtgtggagg tgacggagga cctgcgaccc gatcatttgg 3360 

tgttgtcctg caacgggacg gagttcggct ccagcgggga agaatctgac tagagtgagt 3420 

agtgtttggg gctgggtgtg agcctgcatg aggggcagaa tgactaaaat ctgtggtttt 3480 

ctgtgtgttg cagcagcatg agcggaagcg cctcctttga gggaggggta ttcagccctt 3540 

atctgacggg gcgtctcccc tcctgggcgg gagtgcgtca gaatgtgatg ggatccacgg 3600 

tggacggccg gcccgtgcag cqcgcgaact cttcaaccct gacctacgcg accctgagct 3660 

cctcgtccgt ggacgcagct gccgccgcag ctgctgcttc cgccgccagc gccgtgcgcg 3720 

gaatggccct gggcgccggc tactacagct ctctggtggc caactcgagt tccaccaata 3780 
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atCGcgccag cctgaacgag gagaagctgc tgctgctgat ggcccagctc gaggccctga 3840 

cccagcgcct gggcgagctg acccagcagg tggctcagct gcaggcggag acgcgggccg 3900 

cggttgccac ggtgaaaacc aaataaaaaa tgaatcaata aataaacgga gacggttgtt 3960 

gattttaaca cagagtcttg aatctttatt tgatttttcg cgegcggtiag gccctggacc 4020 * 

accggtctcg atcattgagc acccggtgga tcttttccag gacccggtag agg'tgggctt 4080 

ggatgttgag gtacatgggc atgagcccgt cccgggggtg gaggtagctc cattgcaggg 4140 

cctcgtgctc ggggatggtg ttgtaaatca cccagtcata gcaggggcgc agggcgtggt 4200 

gctgcacgat gtccttgagg aggagactga tggccacggg cagccccttg gtgtaggtgt 4260 

tgacgaacct gttgagctgg gagggatgca tgcgggggga gatgagatgc atcttggcct 4320 

ggatcttgag attggcgatg ttcccgccca gatcccgccg ggggttcatg ttgtgcagga 4380 

ccaccagcac ggtgtatccg gtgcacttgg ggaatttgtc atgcaacttg gaagggaagg 4440 

cgtgaaagaa tttggagacg CQct^tgtgac cgcccaggtt ttccatgcac tcaticcatga 4500 

tgatggcgat gggcccgtgg gcggcggcct gggcaaagac gttccggggg tcggacacat: 4560 

cgtagttgtg gtcctgggtg agctcgtcat aggccatttt aatgaatttg gggcggaggg 4620 

tgcccgactg ggggacgaag gtgccctcga tcccgggggc gtagtt:gccc tcgcagatct 4680 

gcatctccca ggccttgagc tcggaggggg ggatcatgtc cacchgcggg gggatgaaaa 4740 

aaacggtttc cggggcgggg gagatgagct gggccgaaag caggttccgg agcagctggg 4800 

acttgccgca accggtgggg cc^tagatga ccccgatgac cggctgcagg tggtagttga 4860 

gggagagaca gctgccgtcc tcgcggagga ggggggccac ctcgttcatc atctcgcgca 4920 

catgcatgtt ctcgcgcacg agttccgcca ggaggcgctc gccccccagc gagaggagct: 4980 

cttgcagcga ggcgaagttt ttcagcggct tgagtccgtc ggccatgggc attttggaga 5040 

gggtctgttg caagagttcc agacggtccc agagctcggt gatgtgctct agggcatctc 5100 

gatccagcag acctcctcgt ttcgcgggtt ggggcgactg cgggagtagg gcaccaggcg 5160 

atgggcgtCG agcgaggcca gggtccggtc cttccagggc cgcagggtcc gcgtcagcgt 5220. 

ggtctccgtc acggtgaagg g^tgcgcgcc gggctgggcg cttgcgaggg tgbgcttcag 5280 

gctcaticcgg ctggtcgaga accgctcccg gtcggcgccc tgcgcgtcgg ccaggtagca 5340 

attgagcatg agttcgtagt tgagcgcctc ggccgcgtgg cccttggcgc ggagcttacc 5400 

tttggaagtg tgtccgcaga cgggacagag gagggacttg agggcgtaga gcttgggggc 5460 
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gaggaagacg gactcggggg cgtaggcgtc cgcgccgcag ctggcgcaga cggtctcgca 5520 

ctccacgagc caggtgaggt cggggcggtt ggggtcaaaa acgaggtttc ctccgtgctt 5580 

tttgatgcgt ttcttacctc tggtctccat gagctcgtgt ccccgctggg tgacaaagag 5640 

gctgtccgtg tccccgtaga ccgdctttat gggccggtcc tcgagcgggg tgccgcggtc 5700 

ctcgtcgtag aggaaccccg cccactccga gacgaaggcc cgggtccagg ccagcacgaa 5760 

ggaggccacg tgggaggggt agcggtcgtt gtccaccagc gggtccacct tqtccagggt 5820 

atgcaagcac atgtccccct cgtccacatc caggaaggtg attggcttgt aagtgtaggc 5880 

cacgtgaccg ggggtcccgg ccgggggggt ataaaagggg gcgggcccct gpMgtcctc 5940 

actgtcttcc ggatcgctgt ccaggagcgc cagctgttgg ggtaggtatt ccctctcgaa 6000 

ggcgggcatg acctcggcac tcaggttgtc agtttctaga aacgaggagg atttgatatt 6060 

gacggtgccg ttggagacgc ctttcatgag cccctcgtcc atttggtcag aaaagacgat 6120 

ctttttgctg tcgagcttgg tggcgaagga gccgtagagg gcgttggaga gcagcttggc 6180 

gatggagcgc atggtctggt tcttttcctt gtcggcgcgc tccttggcgg cgatgttgag 6240 

ctgcacgtac tcgcgcgcca cgcacttcca ttcggggaag acggtggtga gctcgtcggg 6300 

cacgattctg acccgccagc cgcggttgtg cagggtgatg aggtccacgc tggtggccac 63 60 

.ctcgccgcgc aggggctcgt tggtccagca gaggcgcccg cccttgcgcg agcagaaggg 6420 

gggcagcggg tccagcatga gctcgtcggg ggggtcggcg tccacggtga agatgccggg 6480 

caggagctcg gggtcgaagt agctgatgca ggtgcccaga ttgtccagcg ccgcttgcca 6540 

gtcgcgcacg gccagcgcgc gctcgtaggg gctgaggggc gtgccccagg gcatggggtg 6600 

cgtgagcgcg gaggcgtaca tgccgcagat gtcgtagacg tagaggggct cctcgaggac 6660 

gccgatgtag gtggggtagc agcgcccccc gcggatgctg gcgcgcacgt agtcgtacag 6720 

ctcgtgcgag ggcgcgagga gccccgtgcc gaggttggag cgttgcggct tttcggcgcg 6780 

gtagacgatc tggcggaaga tggcgtggga gttggaggag atggtgggcc tttggaagat 6840 

gttgaagtgg gcgtggggca ggccgaccga gbccctgatg aagtgggcgt aggagtcctg 6900 

cagcttggcg acgagctcgg cggtgacgag gacgtccagg gcgcagtagt cgagggtctc 6960 

ttggatgatg tcatacttga gctggccctt ctgcttccac agctcgcggt tgagaaggaa 7020 

ctcttcgcgg tccttccagt actcttcgag ggggaacccg tcctgatcgg cacggtaaga 7080 

gcccaccatg tagaactggt tgacggcctt gtaggcgcag cagcccttct ccacggggag 7140 

ggcgtaagct tgcgcggcct tgcgcaggga ggtgtgggtg agggcgaagg tgtcgcgcac 7200 
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catgaccttg 


aggaactggt gcfctgaagtc 


gaggtcgtcg 


cagccgccct 


gctcccagag 


7260 


ttggaagtcc 


gtgcgcttct tgtaggcggg 


gttaggcaaa 


gcgaaagtaa 


catcgttgaa 


7320 


gaggatcttg 


cccgcgcggg gc^tgaagtt 


gcgagtgatg 


cggaaaggct 


ggggcacctc 


7380 


ggcccggttg 


ttgatgacct gggcggcgag 


gacgatctcg 


tcgaagccgt 


tgatgttgtg ' 


7440 


cccgacgatg 


tagagttcca cgaatcgcgg 


gcggcccttg 


acgtggggca 


gcttcttgag 


7500 


Gtcgtcgtag 


gtgagctcgg cggggtcgct 


gagcccgtgc 


tgctcgaggg 


cccagtcggc 


7560 


gacgtggggg 


ttggcgctga ggaaggaagt 


ccagagatcc 


acggccaggg 


cggtctgcaa 


7620 


gcggtcccgg 


tactgacgga actgttggcc 


cacggccatt 


ttttcggggg 


tgacgcagta 


7680 


gaaggtgcgg 


gggtcgccgt gccaigcggtc 


ccacttgagc 


t99SLgggcga 


ggtcgtgggc 


7740 


gagctcgacg 


agcggcgggt ccccggagag 


tttcatgacc 


agcatgaagg 


ggacgagctg 


7800 


cttgccgaag 


gaccccatcc aggtgtaggt 


ttccacatcg 


taggtgagga 


a'gagcctttc 


7860 


ggtgcgagga 


t^9cgagccga tggggaagaa 


ctggatctcc 


tgccaccagt 


tggaggaatg 


7920 


gctgttgatg 


tgatggaagt agaaatgccg 


acggcgcgcc 


gagcactcgt 


gcttgtgttt 


7980 


atacaagcgt 


ccgcagtgct cgcaacgctg 


cacgggatgc 


acgtgctgca 


cgagctgtac 


8040 


ctgggttcct 


ttggcgagga atttcagtgg 


gcagtggagc 


gctggcggct 


gcatctcgtg 


8100 


ctgtactacg 


tcttiggccat cggcgtggcc 


atcgtctgcc 


tcgacggtgg 


tcatgctgac 


8160 


gagcccgcgc 


gggaggcagg tccagacctc 


ggctcggacg 


ggtcggagag 


cgaggacgag 


8220 


ggcgcgcagg 


ccggagctgt: ccagggtcct 


gagacgctgc 


ggagtcaggt 


cagtgggcag 


8280 


cggcggcgcg 


cggttgactt gcaggagctt 


ttccagggcg 


cgcgggaggt 


ccagatggta 


8340 


cttgatctcc 


acggcgccgt tggtggctac 


gtccacggct 


tgcagggtgc 


cgtgcccctg 


8400 


gggcgccacc 


accgtgcccc gtttcttctt 


gggcgctgct 


tccatgtcgg 


t'cagaagcgg 


8460 


cggcgaggac 


gcgcgccggg cggcaggggc 


ggctcggggc 


ccggaggcag 


gggcggcagg 


8520 


ggcacgtcgg 


cgccgcgcgc gggcaggttc 


tggtactgcg 


cccggagaag 


actggcgcga 


8580 


gcgacgacgc 


gacggttgac gtcctggatc 


tgacgcctct 


BSgtgaaggc 


caqgggaccc- 


8640 


gtgagtttga 


acctgaaaga gagttcgaca 


gaatcaatct 


cggl:atcgtt 


gacggcggcc 


8700 


tgccgcagga 


tctcttgcac gtcgcccgag 


ttgtcctggt 


aggcgatctc 


ggtcat:gaac 


8760 


tgctcgatct 


cctcctcctg aaggfectccg 


cggccggcgc 


gcccgacggt 


ggccgcgagg 


8820 


tcgttggaga 


tgcggcccat gagctgcgag 


aaggcgttca 


tgccggcctc 


gttccagacg 


8880 
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cggctgtaga ccacggctcc gtcggggtcg cgcgcgcgca tgaccacctg g&cgaggttg 8940 

agctcgacgt ggcgcgtgaa gaccgcgtag ttgcagaggc gctggtagag gtagttgagc 9000 

gtggtggcga tgtgctcggt gacgaagaag tacatgatcc agcggcggag cggpatctcg 9060 

ctgacgtcgo ccagggcttc caagcgttcc atggcctcgt agaagtecac ^cgaagttg 9120 

aaaaactggg agttgcgcgo cgagacggtc aactcctcct ccagaagacg gatgagctcg 9180 

gcgatggtgg cgcgcacctc gcgctcgaag gccccggggg gctcctctto catctectce 9240 

tcttcctcct ccactaacat ctcttctact tcctcctcag gaggcggtgg cggg^gaggg 9300 

gccccgcgte gccggcggcg cacg§gcaga cggtcgatga agcgctcgat ggtctccccg 9360 

cgccggcgac gcatggtcte ggtgai^ggcg cgcccgtcct cgeggggecg cagcatgaag 9420 

acgccgccgc gcabotocag gtggccgccg ggggggtcte egttgggeag ggagagggcg 9480 

etgacgatgc atcttatcaa ttgacccgta gggactcegc gcaaggaoct gagcgtcccg 9540 

agatccacgg gatccgaaaa ccgctgaacg aaggcttcga gccagtcgca gtcgcaaggt 9600 

aggctgagcc cggtttcttg ttcttcgggt atttggtcgg gaggcgggcg ggcgatgctg ' 9660 

ctggtgatga agttgaagta ggcggtcctg agacggcgga tggtggcgag gagcaccagg 9720 

tccttgggcc cggcttgctg gatgcgcaga cggtcggcca tgccccaggc gtggtcctga 9780 

cacctggcga ggtccttgta gtagtcctgc atgagccgct ccacgggcac ctcctcctcg 9840 

cccgcgcggc cgtgcatgcg cgtgagcccg aaccogcgct gcggctggac gagcgccagg 9900 

tcggcgacga cgcgctcggt gaggatggcc tgctggatct gggtgagggc ggtctggaag 9960 

tcgtcgaagt cgacgaagcg gtggpaggct ccggtgttga tggtgtagga gcaqttggcc 10020 

atgacggacc agttgacggt ctggtggccg ggt'cgcac^ga gctcgtggta ctj:gaggcgc 10080 

gagtaggcge gcgtgtcgaa gatgtagtcg ttgcaggcgc gcacgaggta ctggtatccg 10140 

acgaggaagt gcggcggcgg ctggcggtag agcggccatc gctcggtggc gggggcgccg 10200 

ggcgcgaggt cctcgagcat gaggcggtgg tagccgtaga tgtacctgga eatc^caggtg 10260 

atgccggcgg eggtggtgga ggcgcgcggg aactcgegga cgcggtteca gatg^tgcgc 10320 

agcggcagga agtagttcat ggtggccgeg gtctggoccg tgaggcgcgc gc.agtcgtgg 10380 

atgctctaga catacgggca aaaacgaaag oggtcagcgg ctcgactccg tggcctggag 10440 

gctaagcgaa cgggttgggc tgcgcgtgta ccccggttcg aatctcgaat caggbtggag lOSOO 

ccgcagctaa ogbggtactg gcactcccgt ctcgacccaa gcctgctaac gaaacctcca 10560 

ggatacggag gcgggtcgtt ttttggcctt ggtcgctggt catgaaaaac tagtaagcgc 10620 
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ggaaagcggc cgcccgcgat ggctcgctgc cgtagtctgg agaaagaatc gccagggttg 10680 

cgttgcggtg tgccccggtt cgagcctcag cgctcggcgc cggccggatt ccgcggctaa 10740 

cgtgggcgtg gctgccccgt cgtttccaag accccttagc cagccgactt ctccagttac 10800 

ggagcgagcc cctctttttt tttcttgtgt ttttgcpaga tgcatcccgt actgcggcag 10860 

atgcgccccc accctccacc acaaccgccc ctaccgcagc agcagcaaca gccggcgctt 10920 

ctgcccccgc cccagcagca gccagccact accgcggcgg ccgccgtgag cggagccggc 10980 

gttcagtatg acctggcctt ggaagagggc gaggggctgg cgcggctggg ggcgtcgtcg 11040 

ccggagcggc acccgcgcgt gcagatgaaa agggacgctc gcgaggccta cgtgcccaag 11100 

cagaacctgt tcagagacag gagcggcgag gagcccgagg agatgcgcgc ctcccgcttc 11160 

cacgcggggc gggagctgcg gcgcggcctg gaccgaaagc gggtgctgag ggacgaggat 11220 

ttcgaggcgg acgagctgac ggggatcagc cccgcgcgcg cgcacgtggc cgcggccaac 11280 

ctggtcacgg cgtacgagca gaccgtgaag gaggagagca acttccaaaa atccttcaac 11340 

aaccacgtgc gcacgctgat cgcgcgcgag gaggtgaccc tgggcctgat gcacctgtgg 11400 

gacctgctgg aggccatcgt gcagaacccc acgagcaagc cgctgacggc gcagctgttt 11460 

ctggtggtgc agcacagtcg ggacaacgag acgttcaggg aggcgctgct gaatatcacc 11520 

gagcccgagg gccgctggct cctggacctg gtgaacattt tgcagagcat cgtggtgcag 11580 

gagcgcgggc tgccgctgtc cgagaagctg gcggccatca acttctcggt gptgagtctg 11640 

ggcaagtact acgctaggaa gatctacaag accccgtacg tgcccataga caaggaggtg 11700 

aagatcgacg gigttttacat gcgcatgacc ctgaaagtgc tgaccctgag cgacgatctg 11760 

ggggtgtacc gcaacgacag gatgcaccgc gcggtgagcg ccagccgccg gcgcgagctg 11820 

agcgaccagg agctgatgca cagcctgcag cgggccctga ccggggccgg gaccgagggg 11880 

gagagctact ttgacatggg cgcggacctg cgctggcagc ccagccgccg ggccttggaa 11940 

gctgccggcg gttcccccta cgtggaggag gtggacgatg. aggaggagga gggcgagtac 12000 

ctggaagact gatggcgcga ccgtattttt gctagatgca gcaacagcca ccgccgccgc 12060 

ctcctgatcc cgcgatgcgg gcggcgctgc agagccagcc gtccggcatt aactcctcgg 12120 

acgattggac ccaggccatg caacgcatca tggcgetgac gacccgcaat cccgaagcct: 12180 

ttagacagca gcctcaggcc aaccggctct cggccatcct ggaggccgtg gtgccctcgc 12240 

gctcgaaccc cacgcacgag aaggtgctgg ccatcgtgaa cgcgctggtg gagaacaagg 12300 
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ccatccgcgg tgacgaggcc gggctggtgt acaacgcgct gctggagcgc gtggcccgct 12360 

acaacagcac caacgtgcag acgaacctgg accgcatggt gaccgacgtg cgcgaggcgg 12420 

tgtcgcagcg cgagcggttc caccgcgagt cgaacctggg ctccatggtg gcgctgaacg 12480 

ccttcctgag cacgcagccc gccaacgtgc cccggggcca ggaggactac accaacttca 12540 

tcagcgcgct gcggctgatg gtggccgagg tgccccagag cgaggtgtac cagtcggggc 12600 

cggactactt cttccagacc as[tcgccagg gcttgcagac cgtgaacctg agccaggctt 12660 

tcaagaactt gcagggactg tggggcgtgc aggccccggt cggggaccgc gcgacggtgt 12720 

cgagcctgct gacgccgaac tcgcgcctgc tgctgctgct ggtggcgccc ttcacggaca 12780 

gcggcagcgt gagccgcgac tcgtacctgg gctacctgct taacctgtac cgcgaggcca 12840 

tcggacaggc gcacgtggac gagcagacct accaggagat cacccacgtg agccgcgcgc 12900 

tgggccagga ggacccgggc aacctggagg ccaccctgaa cttcctgctg accaaccggt 12960 

cgcagaagat cccgccccag tacgcgctga gcaccgagga ggagcgcatc ctgcgctacg 13020 

tgcagcagag cgtggggctg ttcctgatgc aggagggggc cacgcccagc gcggcgctcg 13 080 

acatgaccgc gcgcaacatg gagcccagca tgtacgcccg caaccgcccg ttcatcaata 13140 

agctgatgga ctacttgcat cgggcggccg ccatgaactc ggactacttt accaacgcca 13200 

tcttgaaccc gcactggctc ccgccgcccg ggttctacac gggcgagtac gacatgcccg 13260 

accccaacga cgggttcctg tgggacgacg tggacagcag cgtgctctcg qcgcgtccag 13320 

gaaccaatgc cgtgtggaag aaagagggcg gggaccggcg gccgtcctcg gcgctgtccg 13380 

gtcgcgcggg tgctgccgcg gcggtgcccg aggccgccag ccccttcccg agcctgccct 13440 

tttcgctgaa cagcgtgcgc agca<gcgagc tgggtcggct gacgcgaccg cgcotgctgg 13500 

gcgaggagga gtacctgaac gactccttgt tgaggcccga gcgcgagaag aacttcccca 13560 

ataacgggat agagagcctg gtggacaaga tgagccgctg gaagacgtac gegcacgagc 13620 

acagggacga gccccgagct agcagcgcag gcacccgtag acgccagcgg cacgacaggc 13680 

agcggggact ggtgtgggac gatgaggatt ccgccgacga cagcagcgtg ttggacttgg 13740 

gtgggagtgg tggtaacccg ttcgctcacc tgcgcccccg tatcgggcgc ctgatgtaag 13800 

aatctgaaaa aataaaagac ggtactcacc aaggccatgg cgaccagcgt gcgttcttct 138.60 
ctgttgtttg tagtagtatg atgaggcgcg tgtacccgga gggtcctcct ccctcgtacg 13920 
agagcgtgat gcagcaggcg gtggcggcgg cgatgcagcc cccgctggag gcgccttacg 13980 
tgcccccgcg gtacctggcg cctacggagg ggcggaacag cattcgttac tcggagctgg 14040 
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cacccttgta cgataccacc cggttgtacc tggtggacaa caagtcggca gacatcgcct 14100 

cgctgaacta ccagaacgac cacagcaact tcctgaccac cgtggtgcag aacaacgatt 14160 

tcacccccac ggaggecagc acccagacca tcaactttga cgagcgctcg cggtggggcg 14220 

gccagcCgaa aaccatcatg cacacca^ca tgcccaacgt gaacgagttc atgtacagca 14280 

acaagttcaa ggcgcgggtg atggtctcgc gcaagacccc caacggggtg gatgatgatt 14340 

atgatggtag tcaggacgag ctgacctacg agtgggtgga gtttgagctg cccgagggca 14400 

acttctcggt gaccatgacc atcgatctga tgaacaacgc catcatcgac aactacttgg 14460 

cggtggggcg gcagaacggg gtgctggaga gcgacatcgg cgtgaagttc gacacgcgca 14520 

acttccggct gggctgggac cccgtgaccg agctggtgat gccgggcgtg tacaccaacg 14580 

aggccttcca ccccgacatc gtcctgctgc ccggctgcgg cgtggacttc accgagagcc 14640 

gcctcagcaa cctgctgggc atccgcaagc ggcagccctt ccaggagggc ttccagatcc 14700 

tgtacgagga cctggagggg ggcaacatcc ccgcgctctt ggatgtcgaa gccta.cgaga 14760 

aaagcaagga ggatagcacc gccgcggcga ccgcagccgt ggccaccgcc tqtaccgagg 14820 

tgcggggcga taattttgct agcgctgcgg cagcggccga ggcggctgaa accga?Lagta 14880 

agatagtcat ccagccggtg gagaaggaca gcaaggacag gagctacaac gtgctcgcgg 14940 

acaagaaaaa caccgcctac cgcagctggt acctggccta caactacggc gaccccgaga 15000 

agggcgtgcg ctcctggacg ctgctcacca cctcggacgt cacctgcggc gtggagcaag 15.060 

tctactggtc gctgcccgac atgatgcaag acccggtcac cttccgctcc acgcgtcaag 15120 

ttagcaiacta cccggtggtg ggcg:ccgagc tcctgcccgt ctactccaag agcttcttca 15180 

acgagcaggc cgtctactcg cagcagctgc gcgccttcac ctcgctcacg cacgtcttca 15240 

accgcttccc cgagaaccag atcctcgtcc gcccgcccgc gcccaccatt accaccgtca 15300 

gtgaaaacgt tcctgcCctc acagatcacg ggaccctgcc gctgcgcagc agtatccggg 15360 

gagtccagcg cgtgaccgtc -actgacgcca gacgccgcac ctgcccctac gtctacaagg 15420 

ccctgggcgt agtcgcgccg cgcgtcctct cgagccgcac cttctaaaaa atgtccattc 15480. 

tcatctcgcc cagtaataac accggttggg gcctgcgcgc gcccagcaag atgtacggag 15540 

gcgctcgcca acgctccacg caacaccccg tgcgcgtgcg cgggcactitc cgcgctccct 15600 
ggggcgccct caagggccgc gtgcgctcgc gcaccaccgt cgacgacgtg atcgaccagg 15660 
tggtggccga cgcgcgcaac tacacgcccg ccgccgcgcc cgtctccacc gtggacgccg 15720 
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tcatcgacag cgtggtggcc gacgogcgcc ggtacgcccg caccaagagc cggcggcggc 15780 

gcatcgcccg gcggcaccgg agcacccccg ccatgcgcgc ggcgcgagec ttgotgcgca 1S840 

gggccaggcg cacgggacgc agggccatgc tcagggcggc cagacgcgcg geetccggca 15900 

gcagcagcgc cggcaggacc cgcagacgcg cggccacggc ggcggcggcg gceatcgcca 15960 

gcatgtcccg ccegeggcgc ggcaacgtgt aetgggtgcg cgacgccgcc accggtgtgc 16020 

gcgtgeecgt gcgcaceegc cceccccgca cttgaagatg ctgacttcgc gatgttgatg 160B0 

tgtcccagcg gcgaggagga tgtceaagcg eaaatacaag gaagagatgc tcSoaggtcat 16140 

cgcgcctgag atetacggce ccgcggcggc ggtgaaggag gaaagaaagc cccgcaaact 16200 

gaagcgggtc aaaaaggaca aaaaggagga ggaagatgac sfgactggcgg agtttgtgcg 16260 

cgagttcgcc ccccggcggc gcgbgcagtg gcgcgggcgg aaagtgaaac cggtgctgcg 16320 

gcccggcacc acggtggtct tcacgccegg cgagcgttcc ggctccgcct ccaagcgctc 16380 

ctacgacgag gtgtacgggg acgaggacat octcgagcag gcggtcgagc gtctgggcga 16440 

gtttgcgtac ggcaagcgca.gocgccccgc gcccttgaaa gaggaggcgg tgtccatccc 16500 

gctggaccac ggcaacccca cgccgagcct gaagccggtg accctgcagc aggtgctacc 16560 

gagcgcggcg ccgcgccggg gcttcaagcg cgagggcggc gaggatctgt acccgaccat 16620 

gcagctgatg gtgcccaagc gccagaagct ggaggacgtg ctggagcaca. tgaaggtgga- 16680 

ccccgaggtg cagcccgagg tcaaggtgcg gcccatcaag caggCggcce cgggcctggg 16740 

cgtgcagacc gtggaoatca agatccccac ggagcccatg gaaacgcaga ccgagcccgt 16800 

gaagcccagc accagcacca tggaggtgca gacggatccc tggatgccag caccagcttc 16860 

caccagcact cgccgaagac gdaagtacgg cgcggccagc ctgctgatgc ccaactacgc 16920 

gctgcatcct tccatcatce coacgccggg ctaccgcggc acgcgcttct accgcggcta 16980 

caccagcagc cgccgccgca agactaccac ccgccgccgt cgtcgcagcc gccgcagcag 17040 
caccgcgact tccgccttgg tgcggagagt gtaccgcagc gggcgcgagc ctctgaccct ' 17100 

gccgcgcgcg cgctaccacc egagbatcgc catttaacta ccgcctccta' cttgcagata 17160 

tggccctoac atgeogccte cgcgtcccca tcacgggcta ecgaggaaga akgcogcgcc 17220 

gtagaaggcc gacggggaac gggctgcgtc gccatcacea ccggcggcgg c^cgccatca 17280 

gcaagcggte ggggggaggc ttcctgcccg cgctgatccc catcatcgcc gcggcgatcg 17340 

gggcgatccc cggcatagct tcfcgtggcgg tgcaggcctc tcagcgccac tgagacacaa 17400 

aaaagcatgg atttgtaata aaaadaaaaa tggactgacg ctcctggtcc tgtgatgtgt 17460 
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gtttttagat ggaagacatc aatttttcgt ccctggcacc gcgacacrggc acgcggccgt 17 520 
ttatgggcac ctggagcgac atcggcaaca gccaactgaa cgggggcgcc ttcadttgga 17580 
gcagtctctg gagcgggctt aagaatttcg ggtccacgct caaaacctat ggcfiacaagg 17640 
cgtggaacag cagpacaggg caggcgctga gggaaaagct gaaagaacag aacrttccagc 17700 
agaaggtggt tgatggcctg gcctcaggca tcaacggggt ggttgacctg gccaaccagg 17760 
ccgtgcagaa acagatcaac agccgcctgg acgcggtccc gcccgcgggg tccgtggaga 17820 
tgccccaggt ggaggaggag ctgcctcccc tggacaagcg cggcgacaag cgaccgcgtc 17880 
ccgacgcgga ggagacgctg ctgacgcaca cggacgagcc gcccccgtac gaggaggcgg 17940 
tgaaactggg cctgcccacc acgcggcccg tggcgcctct ggccaccgga gtgctgaaac 18000 
ccagcagcag ccagcccgcg accctggact tgcctccgcc tcgcccctcc acagtggcta 18060 
agcccctgcc gccggtggcc gtcgcgtcgc gcgccccccg aggccgcccc caggcgaact 18120 
ggcagagcac tctgaacagc ategtgggtc tgggagtgca gagtgtgaag cgccgcogct 18180 
gctattaaaa gacactgtag cgcttaactt gcttgtctgt gtgtatatgt atgtccgccg 18240 
accagaagga ggagtgtgaa gaggcgcgtc gccgagttgc aagatggcca ccccatcgat 18300 
gctgccccag tgggcgtaca tgcacatcgc cggacaggac gcttcggagt acctgagtcc 18360 
gggtctggtg cagttcgccc gcgc'cacaga cacctacttc agtctgggga acaagtttag 18420 
gaaccccacg gtggcgccca cgcacgatgt gaccaccgac cgcagccagc ggctgacgct 18480 
gcgcttcgtg ccogbggacc gcgaggacaa cacctactcg tacaaagtgc gctacacgct 18540 
ggccgtgggc gacaaccgcg tgctggacat ggccagcacc tactttgaca tccgcggcgt 18600 
gctggaccgg ggccctagct tcaaacccta ctctggcacc gcctacaaca gcctagctcc 18660 
caagggagct cccaattcca gccagtggga gcaagcaaaa acaggcaatg ggggaactat 18720 
ggaaacacac acatatggtg tggc'cccaae gggcggagag aatattacaa aagatggtct 16780 
tcaaattgga actgacgtta cagcg'aatca gaataaacca atttatgccg acaaaacatt 16840 
tcaaccagaa ccgcaagtag gagaaga'aaa ttggcaagaa actgaaaact tttatggcgg 18900 
tagagctctt aaaaaagaca caaacatgaa accttgctat ggctcctatg ctagacccac 18960 
caatgaaaaa ggaggtcaag ctaaacttaa agttggagat gatggagttc caaccaaaga 19020 
attcgacata gacctggctt tctttgatac tcccggtggc accgtgaacg gtca^gacga 19080 
gtataaagca gacattgtca tgti^taccga aaacacgtat ttggaaactc cagacacgca 19140 
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tgtggtatac 


aaaccaggca 


aggatgatgc 


aagttctgaa 


atlbaacctoo htea'te^aAi*^ 




tatgcccaac 


agacccaact 


acattgggtt 


cagggacaac 






caacagcact 


ggcaatatgg 


gtgtgettgc 


tggtcaciqcc 


wwwcagcuga acgccgcggc 


19320 


tgatttgcaa 


gacagaaaca 


<;cgagctgbc 


ctaccagctc 


*-**y'-*»'-3«ct ccccgggcga 


19380 


cagaacccgg 


tatttcagta 


tgtggaacca 




• • 
os&ca&gacc ccgat^gcgcg 


19440 


catcatcgaa 


aaccatggtg 


tggaggatga 


abbQccaaac 


i*o u c u ucc wcccggacgg 


19500 


ctctggcact 


aacgccgcat 


accaaggtgt 


qaaaataaaa 


3**^9g*-caag acggcgaugc 


19560 


tgagagtgaa 


tgggaaaatg 


acgatactgt 


tocaoic hcaa 


**vcaacuau gcaaaggcaa 


19620 


cattttcgcc 


atggagatta 


atctccaggc 


taacc t g t gg 


cigaa^ccccc cccacccgaa 


19680 


cgtggccctg 


tacctgcccg 


actcctacaa 




dccaacgcca cgccgccgac 


19740 


caacaccaac 


acctacgatt 


acatgaatgg 


caaaotaar*a 


ccccccccgc tggtagacgc 


19800 


ctacctcaac 


atcggggcgc 


gctqqtCQCt 


Gfoacrfccat'a 


gacaacgtca acccctibcaa 


19860 


ccaccaccgc 


aacgcacTQcc 






ctgggcaaog ggcgctacgb 


19920 


gcccttccac 


atccaggtgc 




^ t* 1* ^CTf^^a ^ 


aagagcctcc tgctcctgcc 


19980 


cgggtcctac 


acctacgagt 




cacisgeicguc 


aacatgatcc tgcagagctc 


20040 


cctaggcaac 


gacctgcgca 






•titcaccagca tcaacctcta 


20100 


cgccaccttc 


ttccccatgg 


CQcacaacac 




ctcgaggcca tgctgcgcaa 


20160 


cgacaccaac 


gaccagtcct 




ccccccggcg 


gccaacatgc tctaccccat 


20220 


cccggccaac 


gccaccaacg 


tQCccalic tc 




cgcaactggg ccgccttccg 


20280 


cggatggtcc 


ttcacgcgcc 




f*nanji 


tcgctcggct cqgggttcga 


20340 


cccctacttc 


gtctactcgg 




wuctccca^ac 


ggcaccttct acctcaacca 


20400 


caccttcaag 


aaggtctcca 


bcacctiticaa 




agctggcccg gcaacgaccg 


20460 


cctcctgacg 


ccc^acgagt 




^w^cciccgbC 


gacggagagg gatadaacgt 


20520 


ggcccagtgc 


aacatigacca 






atgctggccc actacaacab 


20580 


cggcbaccag 


ggcttctacg 


tiQcccoiaaocr 


C!taoaaoaao 


cgcatgbacb ccbt:ct.t:ccg 


20640 


caacttccag 




gcc^ggtcgt 


ggacgaggtc 


aacbacaagg actaccaggc 


20700 


cgtcaccctg 


gcctaccagc 


acaacaactc 


gggcttcgtc 


ggcbaccbcg cgcccaccab 


20760 


gcgccagggc 


cagccccacc 


ccgccaacta 


cccctacccg 


cbcatcggca agagcgccgb 


20820 


cgccagcgcc 


acccagaeuia 


agttcctctg 


cgaccgggtc 


atgtggcgca tccccbbcbc 


20880 
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cagcaacttc atgtccatgg gcgcgctcac cgacctcggc cagaacatgc tctacgccaa 20940 

ctccgcccap gcgctagaca tgaatttcga agtcgacccc atggatgagt ccacccttct 21000 

ctatgttgtc ttcgaagtct tcgacgtcgt ccgagtgcac cagccccacc gcggcgtcat 21060 

cgaagccgtc tacctgcgca cgcccttctc .ggccggcaac gccaccacct aagccgctct 21120 

tgcttcttgc aagatgacgg cgggctccgg cgagcaggag ctcagggcca tcctccgcga 21180 

cctgggctgc gggccctgct tcctgggcac cttcgacaag cgcttccctg gattcatggc 21240 

cccgcacaag ctggcctgcg ccatcgtgaa cacggccggc cgcgagaccg ggggcgagca 21300 

ctggctggcc ttcgcctgga acccgcgctc ccacacatgc tacctcttcg accccttcgg 21360 

gttctcggac gagcgcctca agcagatcta ccagttcgag tacgagggcc tgctgcgtcg 21420 

cagcgccctg gccaccgagg accgctgcgt caccctggaa aagtccaccc agaccgtgca 21480 

gggtccgcgc tcggccgcct gcgggctctt ctgctgcatg ttcctgcacg ccbtcgtgca 21540 

ctggcccgac cgccccatgg acaagaaccc caccatgaac ttaccgacgg gggtgcccaa 21600 

cggcatgctc cagtcgcccc aggtggaacc caccctgcgc cgcaaccagg aagcgctcta 21660 

ccgcttcctc aatgcccact ccgcctactt tcgctcccac cgcgcgcgca tcgag'aagge 21720 

caccgccttc gaccgcatga atcaagacat gtaaaaaacc ggtgtgtgta tgtgaatgct 21780 

ttattcataa taaacagcac atgtttatgc caccttctct gaggctctga ctttatttag 21840 

aaatcgaagg ggttctgccg gctctcggca tggcccgcgg gcagggatac gttgcggaac 21900 

tggtacttgg gcagccactt gaactcgggg atcagcagct tgggcacggg gaggtcgggg 21960 

aacgagtcgc tccacagctt gcgcgtgagt tgcagggcgc ccagcaggtc gggcgcggag 22020 

atcttgaaat cgcagttggg acccgcgttc tgcgcgcgag agttgcggta cacggggttg 22080 

cagcactgga acaccatcag ggccgggtgc ttcacgcttg ccagcaccgt cgcgtcggtg 22140 

atgccctcca cgtccagatc ctcggcgttg gccatcccga agggggtcat cttgcaggtc 22200 

tgccgcccca tgctgggcac gcagccgggc ttgtggttgc aatcgcagtg cagggggatc 22260 
agcatcatct gggcctgctc ggagctcatg cccgggtaca tggccttcat gaac^cctcc " 22320 

agctggcgga aggcctgctg cgccttgccg ccctcggtga agaagacccc gqaggacttg 22380 

ctagagaact ggttggtggc gcagqcggcg tcgtgcacgc agcagcgcgc gtcgttgttg 22440 

gccagctigca ccacgctgcg cccccagcgg ttctgggtga tcttggcccg gttggggttc 22500 

tccttcagcg cgcgctgccc gttctcgctc gccacatcca tctcgatagt gtgctccttc 22560 
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tggatcatca 


cggtcccgtg 


caggcaccgc 


agcttgccct 


cggcbtcggt 


ocaacca boc 




agccacagcg 


cgcagccggt 


gcactcccag 


ttcttgtggg 


cgabcbQQQa 
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acgaagccct 


gcaggaagcg 


gcccatcatc 


gcggtcaggg 
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QQ baaaaa t e 
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gcatcagctg 
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ttcaggtcgc 
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atcagcagcg 


tcatcacttc 


catgcccttc 


tcccaggccg 




***y g*2 ccagg 


22920 


gggttcttca 


ccgccattgt 
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cwggccgacg 


23100 
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23160 
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bbggccgbcg 
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cagccgcgcc 
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gccgcQca ta 




gccagcccaa 
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gagcccaacc 


cgcgcctcaa 
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ccbggccacc 
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gaacgc be bg 
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tg9£L9ttgga 
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gagcgccgtc 


abggaccagg 


bgctcatcaa 


gcgcgccbcg 


24300 



36 



cccctctcgg aggaggagat gcagglicccc gagagttcgg acgagggcaa gcccgtggtc 24360 

agcgacgagc agctggcgcg ctggctggga gcgagtagca ccccccagag cctggaagag 24420 

cggcgcaagc tcatgatggc cgtggccctg gtgaccgtgg agctggagtg tctgcgccgc 24480 

ttctttgccg acgcggagac cctgcgcaag gtcgaggaga acctgcacta cctcttcagg 24S40 

cacgggttcg tgcgccaggc ctgcaagatc tccaacgtgg agctgaccaa cctggtctcc 24600 

tacatgggca tcctgcacga gaaccg^cctg gggcaaaacg tgctgcacac caccctgcgc 24660 

ggggaggccc gccgcgacta catccgcgac tgcgtctacc tgtacctctg ccacacctgg 24720 

cagacgggca tgggcgtgtg gcagcagtgc ctggaggagc agaacctgaa agagctctgc 24780 

aagctcctgc agaagaaccd caaggccctg tggaccgggt tcgacgagcg taccaccgcc 24840 

tcggacctgg ccgacctcat cttccccgag cgcctgcggc tgacgctgcg caacgggctg 24900 

cccgacttta tgagccaaag catgttgcaa aactttcgct ctttcatcct cgaacgctcc 24960 

.gggatcctgc ccgccacctg ctccgcgctg ccctcggact tcgtgccgct gaccttccgc 25020 

gagtgccccc cgccgctctg gagccactgc tacttgctgc gcctggccaa ctacctggcc 25080* 

taccactcgg acgtgatcga ggacgtcagc ggcgagggtc tgctggagtg ccactgccgc 25140 

tgcaacctct gcacgccgca ccgctccctg gcctgcaacc cccagctgct gagcgagacc 25200 

cagatcatcg gcaccttcga gttgcaaggc cccggcgacg gcgagggcaa ggggggtctg 25260 

aaactcaccc cggggctgtg gacctcggcc tacttgcgca agttcgtgcc cgkggactac 25320 

catcccttcg agatcaggtt ctacgaggac caatcccagc cgcccaaggc cgagctgtcg 25380 

gcctgcgtca tcacccaggg ggccatcctg gcccaattgc aagccatcca gaaatirccgc 25440 

caagaatttc tgctgaaaaa gggccacggg gtctacttgg acccccagac cggagaggag 25500 

ctcaacccca gcttccccca ggatgccccg aggaagcagc aagaagctga aagtggagct 25560 

gccgccgccg gaggatttgg aggaagactg ggagagcagt caggcagagg aggaggagat 25620 

ggaagactgg gacagcactc* aggcagagga ggacagcctg caagacagtc tggagg^gga 25680 

agacgaggtg gaggaggcag aggaagaagc agccgccgcc agaccgtcg't cctcggcgga 25740 

gaaagcaagc agcacggata ccatctccgc tccgggtcgg ggtcgcggcg gccgggccca 25800 

cagtaggtgg gacgagaccg ggcgcttccc gaaccccacc acccagaccg gcaagaagga 25860 

gcggcaggga tacaagtcct ggcgggggca caaaaacgcc atcgtctcct gcttgcaagc 25920 

ctgcgggggc aacatctcct tcacdcggcg ctacctgctc ttccaccgcg gggtSraactt 25980 
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cccccgcaac atcttgcatt actaccgtca cctccacagc ccctactact gtttccaaga 26040 
agaggcagaa acccagcagc agcagaaaac cagcggcagc agcagctaga aaatccacag 26100 
cggcggcagg tggactgagg atcgcggcga acgagccggc gcagacccgg gagctgagga 26160 
accggatctt tcccaccctc tatgccatct tccagcagag tcgggggcag gagcaggaac 26220 
tgaaagtcaa gaaccgttct ctgcgctcgc tcacccgcag ttgtctgtat cacaagagcg 26280 
aagaccaact tcagcgcact ctcgaggacg ccgaggctct cttcaacaag tactgcgcgc 26340 
tcactcttaa agagtagccc gcgcccgccc acacacggaa aaaggcggga atta.cgtcac 26400 
cacctgcgcc cttcgcccga ccatcatgag caaagagatt cccacgcctt acatgtggag 26460 
ctaccagccc cagatgggcc tggccgccgg cgccgcccag gactactcca cccgcatgaa 26520 
ctggctcagt gccgggcccg cgatgc^tctc acgggtgaat gacatccgcg cccaccgaaa 26580 
ccagatactc ctagaacagt ca^cc^atcac cgccacgccc cgccatcacc .ttaatccgcg 26640 
taattggccc gccgccctgg tgtaccagga aattccccag cccacgaccg tactacttcc 26700 
gcgagacgcc caggccgaag tccagctgac taactcaggt gtccagctgg ccggcggcgc 267.60 
cgccctgtgt cgtcaccgcc ccgctcaggg tataaagcgg ctggtgatcc gaggcfigagg- 26820 
cacacagctc aacgacgagg tggtgagctc ttcgctgggt ctgcgacctg acggagtctt 26880 
ccaactcgcc ggatpgggga gatcttcctt cacgcctcgt caggccgtcc tgactttgga 26940 
gagttcgtcc tcgcagcccc gctcgggcgg catcggcact ctccagttcg tggaggagtt 27000 
cactccctcg gtctacttca accccttctc cggctccccc ggccactacc cggacgagtt 27060 
catcccgaac ttcgacgcca tcagcgagtc ggtggacggc tacgattgaa tgtcccatgg 27120 
tggcgcagct gacctagctc ggcttcgaca cctggaccac tgccgccgct tccgctgctt 27180 
cgctcgggat ctcgccgagt ttgcctactt tgagctgccc gaggagcacc ctcagggccc 27240 
agcccacgga gtgcggatca tcgtcgaagg gggcctcgac tcccacctgc ttcggatctt 27300 
cagccagcga ccgatcctgg tcgagcgcga acaaggacag acccttctta ctttgtactg 27360 
catctgcaac caccccggcc tgcatgaaag tctttgttgt ctgctgtgta ct^gagtataa 27420 
taaaagctga gatcagcgac tactccggac togattgtgg tgttcccgct atcaaccggt 27480 
cccbgttctt caccgggaac gagaccgagc tccagctcca gtgtaagccc cacaagaagfc 27540 
acctcacctg gctgttccag ggctCcccga tcgccgttgt caaccactgc gacaacgacg 27600 
gagtcctgct gagcggccct gccaacctta ctttttccac ccgcagaagc aagctccagc 27660 
tcttccaacc cttcctcccc gggacctatc agtgcgtctc aggaccctgc catcacacct 27720 
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tccacctgat cccgaatacc acagcgccgc tccccgctac taacaaecaa actacccacc 27780 

aacgccaccg tcgcgacctt tcctctgaat ctaataccac taccggaggt gagctccgag 27840 

gtcgaccaac ctctgggatt tactacggcc cctgggaggt ggtggggtta atagcgctag 27900 

gcctagttgc gggtgggctt ttggttctct gctacctata cctcccttgc tgttcgtact 27960 

tagtggtgct gtgttgctgg tttaagaaat ggggaagatc accctagtga gctgcggtgc 28020 

gctggtggcg gtgttgcttt cgattgtggg actgggcggc gcggctgtag tgaaggagaa 28080 

ggccgatccc tgcttgcatt tcaatcccaa caaatgccag ctgagttttc agcccgatgg 28140 

caatcggtgc gcggtactga tcaagtgcgg atgggaatgc gagaacgtga gaatcgagta* 28200 

caataacaag actcggaaca atactctcgc gtccgtgtgg cagcccgggg accccgagtg 28260 

gtacaccgtc tctgtccccg gtgctgacgg ctccccgcgc accgtgaata atactttcat 28320 

ttttgcgcac atgtgcaaca cggtcatgtg gatgagcaag cagtacgata tgtggccccc 28380 

cacgaaggag aacatcgtgg tcttc.tccat cgcttacagc ctgtgcacgg cgctaatcac 28440 

cgctatcgtg tgcctgagca ttcacatgct catcgctatt cgccccagaa ataatgccga 28500 

gaaagagaaa cagccataac acgttttttc acacaccttg tttttacaga caatgcgtct 28560 

gttaaatttt ttaaacattg tgctcagtat tgcttatgcc tctggttatg caaacataca * 28620 

gaaaaccctt tatgtaggat ctgatiggtac actagagggt acccaatcac aagccaaggt 28680 

tgcatggtat ttttatagaa ccaacactga tccagttaaa ctttgtaagg gtgaattgcc 28740 

gcgtacacat aaaactccac ttaca.t:ttag ttgcagcaat aataatctta cacttttttc 28800 

aattacaaaa caatatactg gtacfetatta cagtacaaac tttcatacag gacaagataa 28860 

atattatact gttaaggtag aaaatcctac cactcctaga actaccacca ccaccactac 28920 

tgcaaagccc actgtgaaaa ctacaactag gaccaccaca actacagaaa ccaccaccag 28980 

cacaacactt gctgcaacta cacacacaca caccaagcta accttacaga ccactaatga 29040 

tttgatcgcc ctgctgcaaa agggggataa cagcaccact tccaatgagg agatacccaa 29100 

atccatgatt ggcattattg ttgctgtagt ggtgtgcatg ttgatcatcg ccttgtgcat 29160 

ggtgtactat gccttctgct acagaaagca cagactgaac gacaagctgg aacactitact 29220 

aagtgttgaa ttttaatttt ttagaaccat gaagatccta ggccttttta gtttttctat 29280 

cattacctct gctctttgtg aatcagtgga tagagatgtt actattacca ctggttctaa 29340 

ttatacactg aaagggccac cctcagg.tat gctttcgtgg tattgctatt ttggaactga 29400 
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cactgatcaa actgaattat gcakttttca aaaaggcaaa acctcaaaet ctaaaatcte 29460 

taattatcaa tgcaatggca ctgatctgat actactcaat gtcacgaaag catatggtgg 29520 

cagttattat tgccctggac aaaacactga agaaatgatt ttttacaaag tggaagtggt 29580 

tgatcccact acaccaccca ccaccacaac tattcatacc acapacacag a^ckaacacc 29640 

agaggcaaca gaagcagagt tggccttcca ggttcacgga gattcetttg ctgtcaatac 29700 

ccetaeaccc gateagcggt gtceggggcc gctagtcagc ggcattgtcg gtgtgctttc 29760 

gggattagca gtcataatca tctgqatgtt catttttgct tgctgctata gaaggctcta 29820 

ccgacaaaaa tcagacceac tgetgaacct ctatgtttaa ttttttccag agccatgaag 29880 

gcagttagcg ctetagtttt ttgt'tctttg attggcattg tttttaatag taaaattacc 29940 

agagttagct ttattaaaoa tgtjtaatgta actgaaggag ataacatcac actagcaggt 30000 

gtagaaggtg ctcaaaacac cacctggaca aaataccatc taggatggag agatatttgc 30060 

acctggaatg taacttatta ttgcatagga gttaatctta ccattgttaa cgctaaccaa 30120 

tctcagaatg ggttaattaa aggacagagt gttagtgtga ccagtgatgg gtactatacc 30180 

cagcatagtt ttaactacaa cattactgtc ataccactgc ctacgcctag cbcacctago 30240 

actaccacac agacaaccac atacagtaca tcaaaccago ctaccac^ac tacagcagca 30300 

gaggttgcca gotcgtctgg ggtccgagtg geatttttga tgttggcccc atctagcagt 30360 

cccactgcta gtaccaatga gcagactact gaatttttgt ccactgtcga gagccacacc 30420 

acagctacct ccagtgcctt ctctagcaco gccaatctct cctcgctttc ccctacacca 30480 

atcagccccg ctactactcc tagccccgct cctcttccca ctcccctgaa gcaaacagac 30540 

ggcggcatgc aatggcagat caccctgctc attgtgatcg ggttggtcat cctg'gccgtg 30600 

ttgctctact acatcttctg ccgccgcatt cccaacgcgc accgcaagcc ggcctacaag 30660 

cccatcgtta tcgggcagcc ggagccgctt caggtggaag ggggtctaag gaatcttctc 30720 

ttctctttta cagtatggtg attgaactat gattcctaga caattcttga tcactkttct 30780 

tatctgcctc ctccaagtct gtgccaccct cgctctggtg gccaacgcca gtccagactg 30840 • 

tattgggccc ttcgcctcct acgtgctctt tgccttcgtc acctgcatct gctgctgtag 30900 

catagtctgc ctgcttatca ccttcttcca gttcattgac tggatctttg tgcgcatcgc 30960 

ctacctgcgo caccaccccc agUccgcga ccagcgagtg gcgcagctgc tcaggctcct 31020 

ctgataagca cgcgggccet gctacttetc gogcttctgc tgttagtgct cccccgtccc 31080 

gtcgacccec ggtcccceac tcagtccccc gaggaggttc gcaaatgcaa attccaagaa 31140 
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ccctggaaat tectcaaatg ctaccgccaa aaaecagaea tgcatcccag ctggktcatg 31200 

atcattggga tcgtgaacat tctggcotgc accctcatct c6tttgtgat ttacccctgc 31260 

tttgaetttg gttggaactc gcqagaggeg ctetatctcc cgcctgaacc tgacacacca 31320 

cc^cagcagc aacctcaggc aca^cacta ccaccaccac agcctaggcc acaatacatg 31380 

cccatattag actatgaggc cgagccacag cgacccatgc tccccgctat tagttacctc 31440 

aatctaaccg gcggagatga ctgacccact ggccaataac aacgtcaacg accttctcct 31500 

ggacatggac ggccgcgcct cggagoagcg actcgcccaa ottcgcattc gtcagcagca 31560 

ggagagagcc gtcaaggagc tgcaggacgg catagccatc caccagtgca agagaggcat 31620 

cttctgcctg gtgaaacagg ccaagatctc ctacgaggtc acccagaccg accatcgcct 31680 

ctcctacgag ctcctgcagc agcgccagaa gttcacctgc otggtcggag tcaaccccat 31740 

cgtcatcacc cagcagtcgg gcgataccaa ggggtgcatc cactgctcet gcgactcccc 31800 

cgactgcgtc cacactctga tcaagaccct ctgcggcctc cgcgacctcc tccccatgaa 31860 

ctaatcaccc ccttatccag tgaaataaag atcatattga tgatgattta aataaaaaaa 31920 

ataatcattt gatttgaaat aaa^gatacaa tcatattgat gatttgagtt taacaaaaac . 31980 

aaagaatcac ttacttgaaa tctgatacca ggtctctgto catgttttct gccaacacca 32040 

cctcactccc ctcttcccag ctctggtact gcaggceccg gcgggctgca aacttJcctco 32100 

acacgctgaa ggggatgtca aatt«:tcct gtccctcaat cttcatttta tcttctatca 32160 

gatgtccaaa aagcgcgtcc gggtggatga tgacttcgac cccgtctacc cctaogatgc 32220 

agacaacgea ccgaccgtgc ccttcaxcaa cccccccttc gtctcttcag atggkttcca 32280 

agagaagccc ctgggggtgt tgtccctgeg actggctgac cccgtcacca ccaagaacgg 32340 

ggaaatcacc cCcaagetgg gagjigggggt ggacctcgac tcgtcgggaa aactcatctc 32400 

eaacaeggcc accaaggccg ccgcccctct cagtatttca aacaacacca tttcccttaa 32460 

aactgctgcc cctttctaca acaacaatgg aactttaagc ctcaatgtct ccacaccatt 32520 

ageagtattt cccacattta acactttagg cataagtctt ggaaacggtc ttcagacttc 32580 

aaataagttg ttgactgtac aactaactca tcctcttaca ttcagotcaa atagcatcac 32640 

agtaaaaaca gacaaagggc tatktattaa ctccagtgga aacagaggac ttga^getaa 32700 

tataagccta aaaagaggac tagtttttga cggtaatgct attgcaacat atattggaaa 32760 

tggcttagac tatggatctt atgatagtga tggaaaaaca agaeccgtaa t:taccaaaat: 32820 
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tggagcagga ttaaattttg atgctaacaa agcaatagct gtcaaactag gcacaggttt 32880 

aagttttgac tccgctggtg ccttgacagc tggaaacaaa caggatgaca agctaacact 32940 

ttggactacc cctgacccaa gccctaattg tcaattactt tcagacagag atgccaaatt 33000 

tactctctgt cttacaaaat gcggtagtca aatactaggc actgtggcag tggdggctgt 33060 

tactgtagga tcagcactaa atccaattaa tgacacagtc aiaaagcgcca tagttttcct 33120 

tagatttgat tccgatggtg tactcAtgtc aaactcatca atggtaggtg attactggaa 33180 

ctttagggag ggacagacca ctcaaagtgt agcctataca aatgctgtgg gattcatgcc 33240 

aaatataggt gcatatccaa aaacccaaag taaaacacct aaaaatagca tagtqagtca 33300 

ggtatattta actggagaaa cfcactatgcc aatgacacta accataactt tcaatggcac 33360 

tgatgaaaaa gacacaaccc cagt'tagcac ctactctatg acttttacat ggcagtggac 33420 

tggagactat aaggacaaaa atattacctt tgctaccaac tcattctctt tttcctacat 33480 

cgcccaggaa taatcccacc cagcaagcca accccttttc ccaccacctt tgtctatatg 33540 

gaaactctga aacagaaaaa taaagtccaa gtgttttatt: gaatcaacag ttttacagga 33600 

ctcgagcagt tattttccct ccaccctccc aggacatgga atacaccacc ctctqccccc 33660 

gcacagcctt gaacatctga atgccattgg tgatggacat .gcttttggtc tccacgttcc 33720 

agacagtttc agagcgagcc agtctcggat cggtcaggga gatgaaaccc tccgggcact 33780 

cccgcatctg cacctcacag ctcaacagct gaggattgtc ctcggtggtc gggatcacgg 33840 

tt^tctggaa gaagcagaag agcggcggtg ggaatcatag tccgcgaacg ggatcggccg 33900 

gtggtgtcgc atcaggcccc gcagcagtcg ctgccgccgc cgctccgtca agctgctgct 33960 

cagggggttc gggtccaggg actccctcag catgatgccc acggccctca gcatcagtcg 34020 

tctggtgcgg cgggcgcagc agcgcatgcg aatctcgctc aggtcactgc agtacgtgca 34080 

acacaggacc accaggttgt tcaacagtcc atagttcaac acgctccagc cgaaactcat 34140 

cgcgggaagg atgctaccca cgtggccgtc gtaccagatc ctcaggtaaa tcaagtggcg 34200 

ctccctccag aagacgctgc ccatgtacat gatctccttg ggcatgtggc ggttcaccac 34260 

ctcccggtac cacatcaccc tctggttgaa catgcagccc cggatgatcc tgcggaacca 34320 

cagggccagc accgccccgc ccgccatgca gcgaagagac cccggatccc ggcaatgaca 34380 

atggaggacc caccgctcgt acccgtggat catctgggag ctgaacaagt ctatgttggc 34440 

acagcacagg catatgctca tgcatctctt cagcactctc agctcctcgg gggtcaaaac 34500 

catatcccag ggcacgggga actcttgcag gacagcgaac cccgcagaac agggcaatcc 34560 
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tcgcacataa cttacattgt gcatggacag ggtatcgca'a tcaggcagca ccgggtgatc 34620 

ctccaccaga gaagcgcggg tctcggtctc ctcacagcgt ggtaaggggg ccggccgata 34680 

cgggtgatgg cgggacgcgg ctgatcgbgt tctcgaccgt gt'catgatigc agttgctttc 34740 

ggacattttc gtacttgctg tagcagaacc tggtccgggc gctgcacacc ^atcgccggc 34800 

ggcggtctcg gcgcttggaa cgctcggtgt taaagttgta aaacagccac tdtctcagac 34860 

cgtgcagcag atctagggcc tcaggagtga tgaagatccc atcatgcctg atagctctga 34920 

tcacatcgac caccgtggaa tgggccaggc ccagccagat gatgcaattt tgttgggttt 34900 

cggtgacggc gggggaggga agaacaggaa gaaccatgat taacttttaa tpcaaacggt 35040 

ctcggagcac ttcaaaatga aggtcacgga gatggcacct ctcgcccccg ctgtgttggt 35100 

ggaaaataac agccaggtca aaggtgatac ggttctcgag atgttccacg gtgg'cttcca 35160 

gcaaagcctc cacgcgcaca tccagaaaca agacaatagc gaaagcggga gggttctcta 35220 

atccctcaac catcatgtta cactcctgca ccatccccag ataattttca tttttccagc 35280 

cttgaatgat tcgaactagt tcctgaggta aaticcaagcc agccaCgata aaaagctcgc 35340 

gcagagcacc ctccaccggc attctt'aagc acaccctcat aattccaaga tattctgctc 35400 

ctggttcacc tgcagcagat tgacaagcgg aatatcaaaa t:ctctgccgc gatccctgag 35460 

ctcctccctc agcaataact gtaagtactc tttcatatcg tctccgaaat ttbtagccat 35520 

aggaccccca ggaataagag aaggpcaagc cacattacag ataaaccgaa gtiCGCcccca 35580 

gtgagcattg ccaaatgtaa gattgaaata agcatgctgg ctagacccgg cgatatcttc 35640 

cagataactg gacagaaaat cgggtaagca atttttaaga aaatcaacaa aagaaaaatc 35700 

ttccaggbgc acgtttaggg cctcgggaac aacgat.ggag taagtgcaag gggtgcgttc 35760 

cagcatggtt agttagctga tctgtaaaaa aacaaaaaat aaaacattaa accatgctag 35820 

cctggcgaac aggtgggtaa atcgttctct ccagcaccag gcaggccacg gggtctccgg 35880 

cgcgaccctc gtaaaaattg tcgctatgat tgaaaaccat cacagagaga cgttcccggt 35940 

ggccggcgtg aatgattcga gaagaagcat acacccccgg aacattggag tccgtgagtg 36000 

aaaaaaagcg gccgaggaag caatgaggca ctacaacgct cactctcaag tccagcaaag 36060 
cgatgccatg cggatgaagc acaaaatttt caggtgcgta aaaaatgtaa ttactcccct 36120 
cctgcacagg cagcgaagct cccgatccct ccagatacac atacaaagcc tcagcgtcca 36180 
tagcttaccg agcggcagca gcagcggcac acaacaggcg caagagtcag agaaaagact 36240 
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gagctctaac ctgtccgccc gctctctgct caatatatag ccccagatct acactgacgt 36300 

aaaggccaaa gtctaaaaat acccgccaaa taatcacaca cgcccagcac acgcccagaa 36360 

accggtgaca cactcagaaa aatacgcgca cttcctcaaa cggccaaact gccgtcattt 36420 

ccgggttccc acgctacgcc atcaaaacac gactttcaaa ttccgtcgac cgttaaaaac 36480 

atcacccgcc ccgcccctaa cggtcgqcgc tcccgcagcc aatcaccttc ctccctcccc 36540 

aaattcaaac agctcatttg catattaacg cgcaccaaaa gtttgaggta tattattgat 36600 

S^^S * 36604 

<210> 3 
<211> 36535 
<212> DNA 

<2X3> chimpanzee aldenoviriis serotype Pan? 
<400> 3 

catcatcaat aatatacctc aaagttttgg tgcgcgttaa tatgcaaatg agctgtttga 60 

attcggggag ggaggaaggt gattggccga gagacgggcg accgttaggg gcggggcggg 120 

tgacgttttt: aatacgtggc cgtgaggcgg agccggtttg caagttctcg tgggaaaagt 180 

gacgtcaaac gaggtgtggt ttgaacacgg aafttactcaa ttttcccgcg ctctctgaca* 240 

ggaaatgagg tgtttctggg cggatgcaag tgaaaacggg ccattttcgc gcgaaaactg 300 

aatgaggaag tgaaaatctg agtaatttcg cgtttatggc agggaggagt attitgccgag* 360 

ggccgagtag actttgaccg attacgtggg ggtttcgatt accgtatttt tcacctaaat 420 

ttccgcgtac ggtgtcaaag tccggtgttt ttacgtaggc gtcagctgat cgccagggta- 480 

tttaaacctg cgctctctag tcaagaefgcc actcttgagt gccagcgagt agagttttct 540 

cctccgcgcc gcgagtcaga tctacacttt gaaagatgag gcacctgaga gacctgcccg 600 

gtaatgtttt cctggctact gggaacgaga ttctggaatt ggtggtggac gccatgatgg 660 

gtggcgaccc tcctgagccc cctaccccat ttgaggcgcc ttcgctgtac gatttgtatg 720 

atctggaggt ggatgtgccc gagaacgacc ccaacgagga ggcggtgaat gatttgttta . 780 

gcgatgccgc gctgctggct gccgagcagg ctaatacgga ctctggctca gacagcgatt 840 

cctctctcca taccccgaga cccggcctgag gtgagaaaaa gatccccgag ctt'aaagggg 900 

aagagctcga cctgcgctgc tatgaggaat gcttgcctcc gagcgatgat gaggaggacg 960 

aggaggcgat tcgagctgca tcgaaccagg gagtgaaagc tgcgggcgaa agctttagcc 1020 

tggactgtcc tactctgccc ggacacggct gtaagtcttg tgaatttcat cgcatgaata 1080 
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ctggagataa 


gaatgtgatg 


tgtgccctgt 


gctatatgag agcttacaac 


cattgtgtbb 




acagtaagtg 


tgattaactt 


tagttgggaa 


ggcagagggt gactgggtgc 


tgactggbtt 


1200 


atttatgtat 


atgttttttt 


atgtgtaggt 


cccgtctctg acgtagatga 


gacccccact 


1260 


tcagagtgca 


tttcatcacc 


cccagaaatt 


ggcgaggaac cgcccgaaga 


tabtatbcat: 




agaccagttg 


cagtgagagt 


caccgggcgg 


agagcagctg tagagaottt 


QOatioac fe t cr 


l!)ftO 


ctacagggtg 


gggatgaacc 


tttggc^cttg 


tgtacccgga aacgccccag 


Qcacbaaoba 


1440 


ccacacatgt 


gtgtttactt 


aa^gtgatgt 


cagtabttat agggtatgoa 






atccgtgttg 


actttaagtg 


cgtggtttat 


gactcagqciq toaqaacbcit 


•73 «l *mtX del A 


1 CS£A 
JLDOU 


gcaggtgcag 


acctgtgtgg 


tcagttcaga 


gcaggactca tggaqatctia 






gaagactttc 


accagactag 


acagctgcta 


gagaactcat cggagggggt 


w w wet W M 




tggagattct 


gcttcggtgg 


gcctctagct 


aagctagtct atagggccaa 






aaggatcaat 


ttgaggatat 


tttgagagag 


tgtcctggta tttttgactc 


tctcaactta 


1800 


ggccatcagt 


ctcactttaa 


ccagagtact 


ctgagagccc ttgacttttc 


bactcctiggc 


1860 


agaactaccg 


ccgcggtagc 


cttttttgcc 


tttatccttg acaaatggag 


tcaagaaacc 


1920 


catttcagca 


gggattaccg 


tctggactgc 


ttagcagtag ctttgtggag 


aacatggagg 


1980 


tgccagcgcc 


tgaatgcaat 


ctccggctac 


ttgccagtac agccggtaga 


cacgc tgaaq 


2040 


atcctgagtc 


tccagtcacc 


ccaggaacac 


caacgccgcc agcagccgca 


gcaggaQcaa 


2100 


cagcaagagg 


aggaggagga 


tcgagaagag 


aacccgagag ccggtctgga 


cc c'fc ccggbg 


2160 


gcggaggagg 


aggagtagct 


gacttigtttc 


ccgagctgcg ccgggtgctg 


actaQQbcfct: 


2220 


ccagtggacg 


ggagaggggg 


attaagcggg 


agaggcatga ggagacbagc 


cacaoaacfca 


2280 


aactgactgt 


cagtctgatg 


agccgcaggc 


gcccagaatc ggtgtggtoo 

mm mm m9^m mm ^^^3 ^^3«S# 


ca tcraaa t r» 


9^4A 


agtcgcaggg 


gatagatgag 


gtctcggtga 


tgcatgagaa atattccctg 




Oil nn 


agacttgttg 


gttggagcct: 


gaggatgatt 


gggaggtagc catcaggaat 


baboccaaac 


2460 


tggctctgaa 


-gccagacaag 


aagtaeaaga 


ttaccaaact gattaatatc 


aQaaabbccb 


2520 

A W A W 


gctacatttc agggaatggg 


gccgaggtgg 


agatcagtac ccaggagagg 


cibQCf ccb t ea 


2S80 


gatgttgtat gatgaatatg 


taccccragact 




acy c ucacga 


2640 


acgcgaggtt caggggtgat 


ggsrtataatg 


gggtggtctt tatggccaac 


accaagcbga 


2700 


cagtgcacgg atgctccttc 


tttgggttica 


ataacatgtg catcgaggcc 


tggggcagtg 


2760 


tttcagtgag gggatgcagc 


ttttcagcca 


actggatggg ggtcgtgggc 


agaaccaaga 


2820 
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gcaaggtgtc agtgaagaaa tgcctgfetcg agaggtgcca cctgggggtg atgagcgagg 2880 

gcgaagccaa agtcaaacac tgcgcctcta ctgagacggg ctgctttgtg ctgatcaagg 2940 

gcaatgccca agtcaagcat aacatgatct gtggggcctc ggatgagcgc ggctaccaga 3000 

tgctgacctg cgccggtggg aacagccata tgctggccac cgtgcatgtg acctcgcacc • 3060 

cccgcaagac atggcccgag ttcgagcaca acgtcatgac ccgatgcaat gtgcacctgg ' 3120 

ggtcccgccg aggcatgttc atgccctacc agtgcaacat gcaatttgtg aaggtgctgc 3180 

tggagcccga tgccatgtcc agagtgagcc tgacgggggt gtttgacatg aatgbggagc 3240 

tgtggaaaat tctgagatat gatgaatcca agaccaggtg ccgggcctgc gaatgcggag 3300 

gcaagcacgc caggcttcag cccgtgtgtg tggaggtgac ggaggacctg cgacccgatc 3360 

atttggtgtt gticctgcaac gggacggagt tcggctccag cggggaagaa tctgaxjtaga 3420 

gtgagtagtg tttgggggag gtggagggct tgtatgaggg gcagaatgac taaaatctgt 3480 

gtttttcCgt gtgttgcagc agcatgagcg gaagcgcctc ctttgaggga ggggtattca 3S40 

gcccttatct gacggggcgt ctcccctcct gggcgggagt gcgtcagaat gtgatgggat 3600 

ccacggtgga cggccggccc gtgcagcccg cgaactcttc aaccctgacc tacgcgaccc 3660 

tgagctcctc gtccgtggac gcagctgcog ccgcagctgc tgcttccgcc gccagcgccg 3720 

tgcgcggaat ggccctgggc gccggctact acagctctct ggtggccaac tcgacttcca 3780 

ccaataatcc cgccagcctg aacgaggaga agctgctgct gctgatggcc cagctcgagg 3840 

ccctgaccca gcgcctgggc gagctgaccc agcaggtggc tcagctgcag gcggagacgc 3900 

gggccgcggt tgccacggtg aaaaccaaat aaaaaatgaa tcaataaata aacggagacg 3960 

gttgttgatt ttaacacaga gtctfcgaatc tttatttgat ttttcgcgcg cggtaggccc 4020 

tggaccaccg gtctcgatca ttgagcaccc ggtggatttt ttccaggacc cggtagaggt 4080 

gggcttggat gttgaggtac atg'ggcatga gcccgtcccg ggggtggagg tagctccatt 4140 

gcagggcctd gtgctcgggg gtggtgttgt aaatcaccca gtcatagcag gggcgcaggg 4200 

cgtggtgctg- cacgatgtcc ttgaggagga gactgatggc cacgggcagc cccttggtgt* 4260 

aggtgttgac gaacctgttg agctgggagg gatgcatgcg gggggagatg agatgcatct 4320 

tggcctggat cttgagattg gcgatgttcc cgcccagatc ccgccggggg ttcatgttgt 4380 

gcaggaccac cagcacggbg tatccggtgc acttggggaa tttgtcatgc aacttggaag 4440 

ggaaggcgtg aaagaatttg gagacgccct tgtgaccgcc caggttttcc atgcactcat 4500 
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ccatgatgat ggcgatgggc ccgtgggcgg cggcctgggc aaagacgttt cgggggtcgg 4560 

acacatcgta gttgtggtcc tgggtgagct cgtcataggc cattttaatg aatttggggc 4620 

ggagggtgcc cgactggggg acgaaggtgc cctcgatccc gggggcgtag ttgccctcgc 4680 

agatctgcat ctcccaggcc ttgagctcgg agggggggat catgtccacc tgcggggcga 4740 

tgaaaaaaac ggtttccggg gcgggggaga tgagctgggc cgaaagcagg ttccggagca 4800 

gctgggactt gccgcagccg gtggggccgt agatgacccc gatgaccggc tgcaggtggt 4860 

agttgaggga gagacagctg ccgtcctcgc ggaggagggg ggccacctcg ttcatcatct 4920 

cgcgcacatg catgttctcg cgcacgagtt ccgccaggag gcgctcgccc cccagcgaga 4980 

ggagctcttg cagcgaggcg aagtttttca gcggcttgag yccgtcggcc atgggcattt 5040 

tggagagggt ctgttgcaag agtt'ccagac ggtcccagag ctcggtgatg tgctctaggg 5100 

catctcgatc cagcagacct cctcgtttcg cgggttgggg cgactgcggg agtagggcac 5160 

caggcgatgg gcgtccagcg ag^ccagggt ccggtccttc cagggtcgca gggtccgcgt 5220 

cagcgtggtc tccgtcacgg tgaaggggtg cgcgccgggc tgggcgcttg cgagggtgcg 5280 

cttcaggctc atccggctgg tcgagaaccg ctcccggtcg gcgccctgcg cgtcggccag 5340 

gtagcaattg agcatgagtt cgtagttgag cgcctcggcc gcgtggccct tggcgcggag. 5400 

cttacctttg gaagtgtgtc cgcagacggg acagaggagg gacttgaggg cgtagagctt 5460 

gggggcgagg aagacggact cgggggcgta ggcgtccgcg ccgcagctgg cgcagacggt 5520 

ctcgcactcc acgagccagg tgsiggtcggg ccggttgggg tcaaaaacga ggtttcctcc 5580 

gtgctttttg atgcgtttct tacctctggt ctccatgagc tcgtgticccc gctgggtgac 5640 

aaagaggctg tccgtgtccc cgtaigaccga ctttatgggc cggtcctcga gcggggtgcc 5700 

gcggtcctcg tcgtagagga accccgccca ctccgagacg aaggcccggg tccaggccag 5760 

cacgaaggag gccacgtggg aggggtagcg gtcgttgtcc accagcgggt ccaccttctc 5820 

cagggtatgc aagcacatgt ccccc.tcgtc cacatccagg aaggtgattg gcttgtaagt 5880 

gtaggccacg tgaccggggg tcccggccgg gggggtataa aagggggcgg gcccctgctc 5940 

gtcctcactg tcttccggat cgctgtccag gagcgccagc tgttggggta ggtattccct 6000 

ctcgaaggct ggcataacct cggcactcag gttgtcagtt tctagaaacg aggaggattt 6060 

gatattgacg gtgccgttgg agacgccttt catgagcccc tcgtccatct ggtcagaaaa 6120 

gacgatcttt ttgttgtcga gcttggtggc gaaggagccg tagagggcgt tggagaggag 6180 

cttggcgatg gagcgcatgg tctggttctt ttccttgtcg gcgcgctcct tggcggcgat 6240 
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gttgagctgc acgtactcgc gcgcqacgca cttccattcg gggaagacgg tggtsagctc 6300 

gtcgggcacg attctgaccc gccagccgcg gttgtgcagg gtgatgaggt ccacgctggt 6360 

ggccacctcg ccgcgcaggg gctcgttggt ccagcagagg cgcccgccct tgcgcgagca 6420 

gaaggggggc agcgggtcca gcatgagctc gtcggggggg tcggcgtcca cggtgaagat 6480 

gccgggcaga agctcggggt cgaagtagct gatgcaggtg tccagatcgt ccagcgccgc 6540 

ttgccagtcg cgcacggcca gcgcgcgctc gtaggggctg aggggcgtgc cccagggcat 6600 

ggggtgcgtg agcgcggagg cgtaqatgcc gcagatgtcg tagacgtaga ggggctcctc 6660 

gaggacgccg atgtaggtgg ggtagcagcg ccccccgcgg atgctggcgc gcacgtagtc 6720 

gtacagctcg tgcgagggcg egaggagccc cgtgccgagg ttggagcgtt gcggcttttc 6780 

ggcgcggtag acgatctggc ggaagatggc gtgggagttg gaggagatgg tgggcctctg 6840 

gaagatgttg aagtgggcgt ggggcaggcc gaccgagtcc ctgatgaagt gggcgtagga 6900 

gtcctgcagc ttggcgacga gctcggcggt gacgaggacg tccagggcgc agtagtcgag 6960 

ggtctcttgg atgatgtcgt acttgagctg gcccttctgc ttccacagct cgcggttgag 7020 

aaggaactct tcgcggtcct tccagtactc ttcgaggggg aacccgtcct gaticggcacg 7080 

gtaagagccc accatgtaga actggbtgac ggcctCgtag gcgcagcagc ccbtctccac 7140 

ggggagggcg taagcttgtg cggcctbgcg cagggaggtg tgggtgaggg cgaaggtgtc 7200 

gcgcaccatg accttgagga actggtgctt gaagtcgagg tcgtcgcagc cgccctgctc 7260 

ccagagctgg aagtccgtgc gcttcttgta ggcggggttg ggcaaagcga aagtaacatc 7320 

gttgaagagg atcttgcccg cgcggggcat gaagttgcga gtgatgcgga aaggctgggg 7380 

cacctcggcc cggttgttga tgacctgggc ggcgaggacg atctcgtcga agccgttgat 7440 

gttgtgcccg acgatgtaga gttccacgaa tcgcgggcgg cccttaacgt: ggggcagctt 7500 

cttgagctcg tcgtaggtga gctcggcggg gtcgctgagc ccgtgctgct cgagggccca 7560 

•gtcggcgacg tgggggttgg cgctgaggaa ggaagtccag agatccacgg ccagg^cggt 7620 

ctg'caagcgg tcccggtact gacggaactg ctggcccacg gccatttttt cgggggtgac 7680 

gcagtagaag gtgcgggggt cgccgtgcca gcggtcccac ttgagctgga gggcgaggtc 7740 

gtgggcgagc tcgacgagcg gcgggt;cccc ggagagtttc atgaccagca tgaag^ggac 7800 

gagctgcttg ccgaaggacc ccatccaggt gtaggtttcc acatcgtagg tgaggaagag 7860 

cctttcggtg cgaggatgcg agccgatggg gaagaactgg atctcctgcc accagttgga 7920 
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ggaatggctg ttgatgtgat 


ggaagtagaa 


abgccciacQa 


^ywyecgagc acccgcgccc 


79B0 


gtgtttatac aagcgtccgc 


agtgctcgca 


acQCtlocaca 


yyaugeacgb gcngcacgag 


8040 


ctgtacctgg gttcctttga 


cgaggaattt 




tgg^gcgctg gcggctgcat 


8100 


ctggtgctgt actacgtcct 


ggccaticggc 


a b acre c a r*a 


ccugccccga cggcggccati 


8160 


gctgacgagc ccgcgcggga 


ggcagg t cca 


Cia e 1 1* foar* t* 
y«w*» ^wyy w w 


cggacgggtc ggagagcgag 


8220 


gacgagggcg cgcaggccgg 


aocbati ccaa 


yy u WW tyaya 


cgctgcggag tcaggtcagt 


8280 


gggcagcggc ggcgcgcggt 


taac t tiacricr 


yoywbuC wCw 


agggcgcgcg ggaggtccag 


8340 


atggtacttg atctccacgg 




ggcga eg tc c 


acggcttgca gggtcccgtg 


8400 


cccctggggc gccaccacca 


•-y V* w u fc u 


ccccccgggc 


gctgcttcca tgccggtcag 


8460 


aagcggcggc gaggacgcgc 


Qc CGicia ccreifi 


^aaggcggc c 


cgggacccgg aggcaggggc 


8520 


ggcaggggca cgtcggcgcc 


QCQCQcaaac 


wy y www wy y 


acbgcgcccg gagaagacbg 


8580 


gcgtgagcga cgacgcgacg 


Qtbaacabce 

9 ^ www 


*»yy ci WW wy A w 


gcctctgggt gaaggccacg 


8640 


ggacccgbga gbttgaacct 


w« Ct*J w 


twyacayaac 


caatctcggt atcgttgacg 


8700 


gcggcctgcc gcaQciatctc 




cccgagccgc 


cctggtaggc gatctcggtc 


8760 


atgaactgct cgatictccbc 


w w w w fc-y ClCt^jVj 


wCuwOyCggC 


cggcgcgctc gaicggtggcc 


8820 


gcgaggtcgt tggagatcica 


~f WS»WCt Vv^jjBI^ W 


^gcgagaagg 


cgttcatgcc ggcctcgttc 


8880 


cagacgcggc tgtagaccac 




99 y cgcgcg 


cgcgcatgac cacctigggcg 


8940 


aggttgagct cgacgtggca 


w*j w^<im*jciww 


y wy bay w cgc 


agaggcgctg gtagaggtag 


9000 


^tigagcgtgg taQcaataho 




aagaagbaca 


tgatccagcg gcggagcggc 


9060 


atcbcgctga cgbcgcccag 


y y w w wwwcia^ 


cgccccacgg 


cctcgtagaa gtccacggcg 


9120 


aagttgaaaa actiacrcraaht 




acggbcaacU 


cctcctccag aagacggatg 


9180 


agctcagcQa tQQtQQcaca 




bcgaaggccc 


cggggggctc ctcttcttcc 


9240 


atCtlCttlCCt CC^dCAC^A^ 


wouCwCCuCu 


acttcctcct 


caggaggcgg cggcggggga 


9300 






agacggtcga 


tgaagcgctc gatggtctcc 


9360 


ccgcgccggc gacgcatigGrt 


C t CCia t" era r«r» 

WW^^ b^Ci w^ 


gcgcgcccgc 


cctcgcgggg ccgcagcgtg 


9420 


aagacgccgc cgcgcatctc 


caggtiggccQ 




ccccgccggg cagggagagg 


9480 


gcgctgacga tgcatcttat 


caattggccc 


gtagggactc 


cgcgcaagga cctgagcgtc 


9540 


tcgagatcca cgggatccga 


aaaccgctga 


acgaaggctt 


cgagccagtc gcecgtcgcaa 


9600 


ggtaggctga gcccggtttc 


ttgttcttcg 


gggatttcgg 


gaggcgggcg ggcgatgctg 


9660 
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ctggtgatga agttgaagta ggcggtcctg agacggcgga tggtggcgag gagcaccagg 9720 

tccttgggco cggcttgctg gatgcgcaga cggtcggcca tgccccaggc gtggtcctga 9730 

cacctggcga ggtccttgta gtagtcctgc atgagecgct ecacgggcac ctcctcctcg 9840 

occgcgcggo ogtgcatgcg cgtgagcecg aacccgcgct ggggctggac gagcgccagg 9900 

toggcgacga cgcgctcggc gaggatggcc tgctgtatct gggtgagggt ggtctggaag 9960 

tcgtcgaagt cgacgaagcg gtggtaggct coggtgttga tggtatagga gcagttggcc X0020 

atgacggaco agttgacggt ctggtggcog ggtcgcacga gctcgtggta dttgaggcgc 10080 

gagtaggcgc gcgtgtogaa gatgtagtog ttgcaggtgc gcacgaggta ctggtatccg 10140 

acgaggaagt gcggcggcgg ctggcggtag agcggccatc gctcggtggc gggggcgccg 10200 

ggcgcgaggt cctcgagcat gaggcggtgg tagccgtaga tgtacctgga catbcaggtg 10260 

atgccggcgg eggtggtgga ggggcgcggg aactcgcgga cgcggttcca gatgttgcgc 10320 

agcggcagga agtagttcat ggtggccgcg gtctggcccg tgaggcgcgc gcagtcgtgg 10380 

atgcttctaga catacgggca aaaacgaaag cggtcagcgg ctcgactccg tggcctggag 10440 

gctaagcgaa cgggttgggc tgcgcgtgta ccccggttcg aatctcgaat cag^btggag 10500 

ccgcagctaa cgtggtaotg gcactcccgt ctcgacccaa gcctgctaac giaaccbcca 10560 

ggatacggag gcgggtcgtt tttt^cctt ggtcgetggt catgaaaaac ta^taagcgc 10620 

ggaaagcgac cgoccgcgat ggqtcgctgc cgtagtctgg agaaagaatc gcgagggttg 1068O 

cgttgcggtg tgccccggtt ogagcctcag cgctcggcgc cggccggatt ccgcagctaa 10740 

cgtgggcgtg gotgccccgt cgtttccaag accccttagc cagccgactt ctccagttac 1O8OO 

ggagcgagcc cctctttttc ttgtgttttt gocagatgca tcccgtactg cggcagatgc 10860 

gcccccaccc tccacctcaa ccgcccctac cgccgcagca gcagcaacag ccggcgcttc 10920 

tgcccccgcc ceagcagcag ccagccacta ccgcggcggc cgccgtgagc ggagccggcg 10980 

ttcagtatga cctggccttg gaagagggcg aggggctggc gcggctgggg gcgtcgtcgc 1X040 

cggagcg^ca cccgcgcgtg cagatgaaaa gggacgctcg cgaggcctac gtgcccaagc llloo 

agaacctgtt cagagacagg agcggcgagg agcccgagga gatgcgcgcc tcccgcttcc 11160 

acgcggggcg ggagctgcgg cgcggcctgg accgaaagcg ggtgctgagg gacgaggatt 11220 

tcgaggcgga cgagctgacg gggatcagcc ccgcgcgcgc gcacgtggcc gcggfccaaec 11280 

tggtcacggc gtacgagcag accgtgaagg aggagagcaa cttccaaaaa tccttcaaca H340 
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accacgtgcg cacgctgatc gcgcgcgagg aggtgaccct gggcctgatg cacctgtggg 11400 
acctgctgga ggccatcgtg cagaacccca cgagcaagcc gctgacggcg cagctgtttc 11460 
tggtggtgca gcacagtcgg gacaacgaga cgttcaggga ggcgctgctg aatatcaccg 11520 
agcccgaggg ccgctggctc ctggacctgg tgaacattct gcagagcatc gtggt'gcagg 11580 
agcgcgggct gccgctgtcc gagaagctgg cggctatcaa cttctcggtg ctgagcctgg 11640 
gcaagtacta cgctaggaag atctacaaga ccccgtacgt gcccatagac aaggaggtga 11700 
agatcgacgg gttttacatg cgcatgaccc tgaaagtgct gaccctgagc gacgatctgg 11760 
gggtgtaccg caacgacagg atgcaccgcg cggtgagcgc cagccgccgg cgcgagctga 11820 
gcgaccagga gctgangcac agcctgcagc gggccctgac cggggccggg accgaggggg 11880 
agagctactt tgacatgggc gcggacctgc gctggcagcc cagccgccgg gcctt^aag 11940 
ctgccggcgg ttccccctac gtggaggagg tggacgatga ggaggaggag ggcgagtacc 12000 
tggaagactg atggcgcgac cgtatttttg ctagatgcag caacagccac cgcctcctga 12060 
Hcccgcgatg cgggcggcgc tgcagagcca gccgtccggc attaactcct cggacgattg. 12120 
gacccaggcG atgcaacgca tcatggcgct gacgacccgc aatcccgaag cctttagaca 12180 
gjcagcctcag gccaaccggc tctcggccat cctggaggcc gtggtgccct cgcgctcgaa 12240 
ccccacgcac gagaaggtgc tggccatcgt gaacgcgctg gtggagaaca aggccatccg 12300 
cggcgacgag gccgggctgg tgtacaacgc gctgctggag cgcgtggccc gctacaacag 12360 
caccaacgtg cagacgaacc tggaccgcat ggtgaccgac gtgcgcgagg cggtgtcgca 12420 
gcgcgagcgg ttccaccgcg agtcgaacct gggctccatg gtggcgctga acgccttcct 12480 
gagcacgcag cccgccaacg tgccccgggg ccaggaggac tacaccaact tcatcagcgc 12540 
gctgcggctg atggtggccg aggtgcccca gagcgaggtg taccagtcgg ggccggacta 12600 
cttcttccag accagtcgcc agggcttgca gaccgtgaac ctgagccagg ctttcaagaa 12660 
cttgcaggga ctgtggggcg tgca^cccc ggtcggggac cgcgcgacgg tgtcgagcct 12720 
'gctgacgccg aactcgcgcc tgctgctgct gctggtggcg cccttcacgg acagcggcag* '12760 
cgtgagccgc gactcgtacc tgg'gctacct gcttaacctg taccgcgagg ccatcgggca 12840 
ggcgcacgtg gacgagcaga cctaccagga gatcacccac gtgagccgcg cgctgg9cca 12900 
ggaggacccg ggcaacctgg aggccaccct gaacttcctg ctgaccaacc ggtcgcageta 12960 
gatcccgccc cagtacgcgc tgagcaccga ggaggagcgc atcctgcgct acgtgcagca 13020 
gagcgtgggg ctgttcctga tgcaggaggg ggccacgccc agcgccgcgc tcgacatgac 13080 

51 



cgcgcgcaac atggagccca gcatgtacgc tcgcaaccgc ccgttcatca ataagctgat 13140 

ggactacttg catcgggcgg ccgccatgaa ctcggactac tttaccaacg ccatittgaa 13200 

cccgcactgg otcccgccgc cogggttcta cacgggcgag tacgacatgo ccgaccccaa 13260 

cgacgggttc ctgtgggacg acgtggacag cagcgtgtt^ tegcegcgcc cc^ccaccac 13320 

cgtgtggaag aaagagggcg gggaccggcg gccgtcctcg gcgctgtccg gtcgcgcggg 13380 

tgctgccgcg goggtgcctg aggccgccag ccccttcccg agcctgccct tttcgctgaa 13440 

cagcgtgcgc agcagcgagc tgggtcggct gacgcggccg cgcctgctgg gcgaggagga 13500 

gtaoctgaac gactccttgt tgaggcccga gcgcgagaag aacttcccca ataacgggat 13S60 

agagagcctg gtggacaaga tgagccgctg gaagacgtac gcgcapgagc acagggacga 13620 

gccccgagct agcagcagcg caggcacccg tagacgccag cgacacgaca ggcagcgggg 13680 

tctggtgtgg gacgatgagg attcogccga cgacagcagc gtgttggaot tg^gtgggag 13740 

tggtggtggt aacccgttcg etoa.cttgcg cccccgtatc gggcgcctga tgtaagaatc 13800 

tgaaaaaata akaaacggta ctcaccaagg ccatggcgac cagcgtgcgt tcttctctgt " 13860 

tgtttgtagt agtatgatga ggcgcgtgta cccggagggt cctcctccct cgtacgagag. 13920 

cgtgatgcag caggcggtgg cggcggcgat gcagcccccg ctggaggcgc cttaigtgcc 13980 

cccgcggtao ctggcgccta cggaggggcg gaacagcatt cgttactcgg agctggcacc 14040 

cttgtacgat accacccggt tgtacctggt ggacaacaag tcggcggaca tcgcctcgot 14100 

gaactaccag aacgaccaca gcaacttcct gaccaccgtg gtgcagaaca aegatttcac 14160 

ccccacggag gccagcaccc agaccatcaa ctttgacgag cgctcgcggt ggggcggcca 14220 

gctgaaaacc atcatgcaca ccaacatgcc caacgtgaac gagttcatgt acagcaacaa 14280 

gttcaaggcg cgggtgatgg tctdgcgcaa gacccccaat ggggtogcgg tggatgagaa 14340 

ttatgatggt agtcaggacg agctgactta cgagtgggtg gagtttgagc tgcccgaggg 14400 

caacttctcg gtgaccatga cqategatct gatgaacaac gccatcatcg acaactactt 14460 

Sgcggtgggg cgtcagaacg gggtqctgga gagcgacatc ggcgtgaagt tc^acacgcg i4S20 

caaottccgg ctgggctggg accbcgtgac cgagctggtg atgccgggcg tgtacaccaa 14580 

cgaggccttc caccccgaca tcgtcctgct gcccggctgc ggcgtggact tcacegagag 14640 

ccgcetcagc aacctgctgg gcatc<;gcaa gcggcagccc ttccaggagg gcttccagat 14700 

cctgtaegag gacctggagg ggggcaacat ccccgcgctc ttggatgtcg a^gcctatga 14760 
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gaaaagcaag gaggaggccg ecacagcggc gaccgcagcc gtggccaccg cctctaccga 14820 

ggtgcggggc gataattttg ctagfcgccgc ggcagtggcc gaggeggctg aaaccgaaag 14880 

taagatagte atccagccgg tggagaagga cagcaaggac aggagctaca acgtgctcgc 14940 

ggacaagaaa aacaccgcct accgcagctg gtacctggcc tacaactacg gcg^ccccga ISOOO 

gaagggcgtg cgctcctgga cgctgcteac cacctpggao gtcacctgcg gcgcggagca X5060 

agtctactgg tcgctgcccg acatgatgca agacccggtc accttcogot ccacgcgtca 15120 

agttagcaac tacccggtgg tgggcgccga gctcctgccc gtctactcca agagcttctt 15180 

caacgagcag gccgtctact cgcagcagct gcgcgccttc acctcgotca cgcacgtctt 15240 

caaccgcttc cccgagaacc agatcctcgt ccgcccgccc gcgcccacca ttaccacogt 1S300 

cagtgaaaac gttcctgctc tcacagatca cgggaccctg ccgctgcgca gcagtatccg 15360 

gggagtccag cgcgtgaccg tcaotgacgc cagacgccgc acctgcccct acgtctacaa 1S420 

ggccctgggc gtagtcgcgc cgcgcgtoct ctcgagccgc accttctaaa aaatgtccat IS480 

tctcatctcg cccagtaata acaccggttg gggcctgcgc gcgcccagca agatgtacgg 15540 

aggcgctcgc caacgctcca cgoaacaccc cgtgcgcgtg cgcgggcact tccgcgctcc 15600 

ctggggcgcc cccaagggcc gcgtgcgctc gcgcaccacc gtcgacgacg tgatcgacca 15660 

9gfcggtggcc gaegcgcgca actacacgcc cgcegccgcg cccgcctcca ccgtggacgc .15720 

cgtcatcgac agcgtggtgg ccgitgcgcg ccggcacgcc cgcgccaaga gc.cggcggcg 15780 

gcgcategcc cggoggeacc ggagcacccc cgccatgcgc gcggcgcgag ccttgctgcg 15840 

cagggccagg cgcacgggac gcagggccat gctcagggcg gccagacgcg cggcctccgg 15900 

cagcagcagc gccggcagga cccgcagacg cgcggccacg gcggcggcgg cggccatcgc 15960 

cagcatgtcc cgcccgcggc gc^baacgt gtactgggtg cgcgacgccg cckccggcgt 16020 

gcgcgtgccc gtgcgcaccc gcccccctcg cacttgaaga tgctgacttc gcgatgttga 16080 

tgtgtcccag cggcgaggag gatgtccaag cgcaaataca aggaagagat gctccaggtc 16140 

atcgcgcctg agatctacgg ccccgcggtg aaggaggaaa gaaagccocg caaactgaag 16200 

ogggtcaaaa aggacaaaaa ggaggaggaa gatgtggacg gactggtgga gtttgtgcgc 16260 

gagttcgccc cccggcggog cgtgcagtgg cgcgggcgga aagtgaaacc ggtgctgcgg 16320 

cccggcacca cggtggtctt caigcccggc gagcgttccg gctccgcctc caagogctcc 16380 

tacgacgagg tgtacgggga cgaggacatc ctcgagcagg cggtcgagcg tctgggcgag 16440 

tttgcttacg gcaagcgcag ccgccccgcg cccttgaaag aggaggcggt gtccatcccg 16500 
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ctggaccaccr gcaaccccac socgagcctg aagccggtga ccctgcagca 99t9ct9ccg 
agcgcggcgc cgcgccgggg cttcaagcgc gagggcggcg aggatctgta cccgaccatg ^,e20 
cagctgatgg tgcccaagcg ccagaagctg gaggacgtgc tggagcacat 9aagg,ggac X6680 
cccgaggtgc agcccgaggt caaggtgcgg cccatcaagc aggtggcccc gggcctgggc 16740 
gtgcagaccg tggacatcaa gatccccacg gagcccatgg aaacgcagac cgagiiccgtg i«oo 
aagcccagca ccagcaccat ggaggtgcag acggatccct ggatgccggc gccggcttcc «860 
accactcgcc gaagacgcaa gtacggcgcg gccagectgc tgatgcccaa ctacgcgctg X6920 
catccttcca tcatcccoac gccgggctac cgcggcacgc gcttctaccg cggctacacc l«80 
agcagccgcc gcaagaccac cacccgccgc cgccgtcgtc gcacccgccg cagcagcacc 17040 
gcgac«ecg ccgccgccct ggtgcggaga gtgtaccgca gogggcgcga gcctctgacc X7100 
ctgccgcgcg cgcgctacca cccgagcatc gccatttaac tctgccgtcg cctcctactt X7X.0 
gcagatatgg ccctcacatg ccgccj:ccgc gtccccatta cgggctaccg aggaigaaag X7220 
ccgcgccgta gaaggctgac ggggaacggg ctgcgtcgcc atcaccaccg gcggcggcgc X7280 
gccatcagca agcggttggg gggaggcttc ctgccogcgc tgatccccat catcgccgcg X7340 
acgatcgggg cgatccccgg catagcttcc gtggcggtgc aggcctctca gcgccactga " X7400 
gacacagctt ggaaaatttg taataaaaaa atggactgac gctcctggtc ctgtgatgtg X7460 
tgtttttaga tggaagacat caatttcccg tccctggcac cgcgacacgg cacgcggccg X7520 
tttatgggca cctggagcga catcggcaac agccaactga acgggggcgc cttcaattgg X7580 
agcagtctct ggagcgggct taagaatttc gggtccacgc tcaaaaccta tggcaacaag X7640 
gcgtggaaea gcagcacagg gcaggcgctg agggaaaagc tgaaagagca . gaacttccag X7700 . 
cagaaggtgg Ccgatggcct ggcctcgggc atcaacgggg tggtggacct ggccaaccag X7760 
gccgtgcaga aacagatcaa cagccgcctg gacgcggtce cgccogcggg gtccgtggag X7a20 
atgocccagg tggaggagga gctgcctccc ctggacaage gcggcgacaa gcgaccgcgt X7880 
cccgaogogg aggagacgct gctgicgcac acggacgagc cgcccecgta cgaggaggcg 17940 
gtgaaactgg gtctgcccac cacgcggccc gtggcgcctc tggccaccgg ggtgctgaaa 18000 
cccagcagca gcagccagcc cgcgapcctg gacttgcctc cgcctgcttc ccgcccctcc 18060 
acagtggcta agcccctgcc gcc^gtggcc gtcgcgtcgc gcgccccccg aggccgcccc 18120 
caggcgaact ggcagagcac tctgadcagc atcgtgggtc tgggagtgca gagtgtgaag X8X80 
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cgccgccgct gctattaaaa 9ac^<;tgtag cgcttaactt gcttgtctgt gtgtatatgt 18240 

atgtccgccg accagaagga ggaa^aggcg ogtcgccgag ttgcaagatg gccaccccat 18300 

cgatgctgcc ccagtgggcg tacatgcaca tcgccggaca ggacgctccg gagtacctga 18360 

gtccgggtct ggtgcagttc gcccgcgcca cagacaccta cttcagtctg gggaacaagt 18420 

ttaggaaccc cacggtggcg cccacgcacg atgtgaccac cgaccgcagc cagcggctga 18480 

cgctgcgctt cgtgcccgtg gaccgcgagg acaacaccta ctcgtacaaa gtgcgctaca 18540 

cgctggccgt gggcgacaac cgcgtgctgg acatggccag cacctacttt gacatccgcg 18600 

gcgtgctgga tcgggggccc agcttcaaac cccactccgg caccgcctac aa.cagcctgg 18660 

ctcccaaggg agcgcccaac acttgccagt ggacatataa agctggtgat actgatacag 18720 

aaaaaaccta tacatatgga aatgbacctg tgcaaggcat tagcattaca aaggatggta 18780 

ttcaacttgg aactgacagc gatggtcagg caatctatgc agacgaaact tatcaaccag 18840 

agcctcaagt gggtgatgct gaatggcatg acatcactgg tactgatgaa aaatatggag 18900 

gcagagctct taagcctgac accaaaatga agccttgcta tggttctttt gecaagccta 18960 

ccaataaaga aggaggccag geaaatgtga aaaccgaaac aggcggtacc aaagaatatg 19020 

acategacat ggcattcttc gataatcgaa gtgcagctgc cgccggcota gccicagaaa 19080 

ttgttttgta tactgagaat gtggatctgg aaactccaga tacccatatt gtatacaagg 19140 

caggtacaga tgacagtago tcttct^tca atttgggtca gcagtccatg cccaacagac 19200 

ccaactacat tggcttcaga gacaacttta tcggtctgat gtactacaac agcactggca 19260 

atatgggtgt actggctgga caggcctccc agetgaatgc tgtggtggac ttgcaggaca 19320 

gaaacaccga actgtcctac cagctcttgc ttgactctct gggtgacaga accaggtatt 19380 

tcagtatgtg gaatcaggcg gtggacagtt atgaccccga tgtgcgcatt attgaaaatc 19440 

acggtgtgga ggatgaactt cctaactatt gcttccccct ggatgctgtg gg.tagaactg 19500 

atacttacoa gggaattaag gccaatggtg ataatcaaac cacctggacc aaagatgata 19560 

ctgttaatga tgctaatgaa ttgggcaagg gcaatccttt cgccatggag atcatfcatcc 19620 

aggcoaacct gtggcggaac ttcctctacg cgaacgtggc gctgtacctg cccgactcct 19680 

acaagtacac gccggccaac ateracgotge ccaccaacac caacacetac gattacatga 19740 

acggccgcgt ggtggcgccc tcgctggtgg acgcctacac caacatcggg gegcgctggt 19800 

cgctggaccc catggacaac gtcaacccct tcaaccacca ccgcaaegcg gg^cfegcgat 19860 

accgctccat gctcctgggc aacgggcgct acgtgccctt ccacatccag gtgccceaaa 19920 
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agtttttcgc catcaagagc ctcctgctcc -tgcccgggtc ctaeacctac gagtggaact 19980 

tccgcaagga cgtcaacatg atpctgcaga gctccctcgg caacgacctg cgcacggacg 20040 

gggcctccat cgccttcacc agcatcaacc tctacgccac cttcttcccc atggcgcaca 20100 

acaccgcctc cacgctcgag.gccatgctgc gcaacgacac caacgaccag tccttcaacg 20160 

actacctctc ggcggccaac atgctctacc ccatcccggc caacgccacc aacgtgccca 20220 

tctccatccc ctcgcgcaac tgggccgcct tccgcggctg gtccttcacg cgcctcaaga 20280 

cccgcgagac gccctcgctc ggcbccgggt tcgaccccta cttcgtctac tcgggctcca 20340 

tcccctacct cgacggcacc ttctacctca accacacctt caagaaggtc tccatcacct 20400 

tcgactcctc cgtcagctgg cccggcaacg accgcctcct gacgcccaac gagttcgaaa 20460 

tcaagcgcac cgtcgacgga gaggggtaca acgtggccca gtgcaacacg accaaggact 20520 

ggttcctggt ccagatgctg gcccactaca acatcggcta ccagggcttc tacgtgcccg 20580 

agggccacaa ggaccgcatg tactccttct tccgcaactt ccagcccatg agc'cgccagg 20640 

tcgtggacga ggtcaactac aaggactacc aggccgtcac cctggcctac cagcacaaca 20700 

actcgggctt cgtcggctac ctcg'cgccca ccatgcgcca gggccagccc taccccgcca 20760 

actaccccta cccgctcatc ggcaagagcg ccgtcgccag cgtcacccag aaaaagttcc 20820 

tctgcgaccg ggtcatgtgg cgcatcccct tctccagcaa cttcatgtcc atgggcgcgc 20860 

tcaccgacct cggccagaac atgctctacg ccaactccgc ccacgcgcta gacatgaatt 20940 

tcgaagtcga ccccatggat gagtccaccc ttctictatgt tgtcttcgaa gtcttcgacg 21000 

tcgtccgagt gcaccagccc caccgcggcg tcatcgaggc cgcctacctg cgcacgccct 21060 

tctcggccgg caacgccacc acctaagcct cttgcttctt gcaagatgac ggcctgcgcg 21120 

ggctccggcg agcaggagct cagggccatc ctccgcgacc tgggctgcgg gccctgcttc 21180 

ctgggcacct tcgacaagcg cttcccggga ttcatggccc cgcacaagct ggcctgcgcc 21240 

atcgtcaaca cggccggccg cgagaccggg ggcgagcact ggctggcctt cgcctggaac 21300 

ccgcgctccfc acacctgcta cqtcttcgac 'cccttcgggt tctcggacga gcgcctcaag 21360 

cagatctacc agttqgagta cgagggcctg ctgcgtcgca gcgccctggc cacegaggac 21420 

cgctgcgtca ccctggaaaa gtccacccag accgtgcagg gtccgcgctc ggccgcctgc 21480 

gggctcttct gctgcatgtt cctgcacgcc ttcgtgcact ggcccgaccg ccccatggac 21540 

aagaacccca ccatgaactt gctgacgggg gtgcccaacg gcatgctcca gtcgccccag 21600 
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Stsgaaccca ccctgcgccg caaceaggag gcgctctacc gcttcctcaa cgcccactcc 2^eea 
9cctact«c gctcccaccg cgcgcgcatc gagaaggcca ccgccttcga ccgcatgaat 2x720 
caagacatgt aatccggtgt gtgtatgtga atgctttatt catcataata aackgcacat 2X780 
gt^atgcca ccttctctga ggctctgact ttatttagaa atcgaagggg ttctgecggc 2X840 
tctcggcatg gcccgcgggc agggatacgt tgcggaactg gtacttgggc agccacttga 2X900 
actcggggat cagcagcttc ggcacgggga ggtcggggaa cgagtcgetc cacagcttgc 2X960 
acgtgagttg cagggcgccc agcaggtcgg gcgcggagat cttgaaatcg ca^ttgggac 22020 
ccgcgttctg cgcgcgagag .cacggtaca cggggttgca gcactggaac accatcaggg 22080 
ccgggtgctt cacgctcgcc agcaccgtcg cgtcggtgat gccctccacg ttcagatcct 22X40 
cggcgttggc catcccgaag ggggtcatct tgcaggtctg ccgccccatg ctgggcacgc 22200 
agccgggctt gtggttgcaa tcgcagtgca gggggatcag catcatctgg gcetgctcgg 22260 
agctcatgcc cgggtacatg gccttcatga aagcctccag ctggcggaag gcctgctgcg 22320 
ccttgccgcc ctcggtgaag ^agaccccgc aggacttgct agagaactgg ttggtggcgc 22380 
agccagcgtc gcgcacgcag cagcgcgcgt cgttgttggc cagetgcacc acgctgcgcc 22440 
cccagcggtfctgggegatc ttggcccggt cggggttctc cttcagcgcg cgctgcccgt 22500 
tctcgctcgc cacatccatc tcgatcgtgc gctccttctg gatcatcacg gtcccgtgca 22560 
ggcaccgcag cttgccctcg gcptcggtgc acccgtgcag ccacagc^ cagccggtgc 22620 
tctcccagtt cttgtgggcg atcCgggagt gcgagtgcac gaagccctgc aggkagcggc 22680 
ccatcatcgt ggtcagggtc ttgttgctgg tgaaggtcag cggaacgccg cggtgctcct 22740 
cgttcacata caggtggcag atacggcggt acacctcgcc ctgctcgggc atcagctgga 22800 
aggcggactt eaggtcgetc tccacgcggt accggtccat cagcagcgtc atcacttcca 22860 
tgcccttctc ccaggccgaa acgatcggca ggctcagggg gttcttcacc gttgtcatct 22920 
tagtcgccgc cgccgaagtc agggggtcgt tctcgtccag ggtctcaaac actc^cttgc 22980 
cgtccttctc ggtgatgcgc acggggggaa agctgaagcc cacggccgcc kg^tcctcct 23040 
cggcctgcct ttcgtcctcg ctgtcetggc tgatgtcttg caaaggcaca tgc^tggtct 23X00 
tgcggggttt ctttttgggc ggcagaggcg gcggcggaga cgtgctgggc gag<^cgagt 23X60 
tctcgctcac cacgactatt tcttctcctt ggccgtcgtc cgagaccacg bggcggtagg 23220 
catgcctctt ctggggcaga ggcggaggcg acgggotctc gcggttcggo gggcggetgg 23280 
cagagcccct tccgcgttcg ggggtgcgct cctggcggcg ctgctctgac tgacttcctc 23340- 
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cgcggccggc cattgtgttc tcctagggag caagcatigga gactcagcca tcgtcgccaa 23400 
catcgccatc tgcccccgcc gccgccgacg agaaccagca gcagcagaat gaaagcttaa * 23460 
ccgccccgcc gcccagcccc acctccgacg ccgcagcccc agacatgcaa gagatggagg 23520 
aatccatcga gattgacctg ggctacgtga cgcccgcgga gcacgaggag gagctggcag 23580 
cgcgcttttc agccccggaa gagaaccacc aagagcagcc agagcaggaa gcagagagcg 23640 
agcagaacca ggctgggctc gagcatggcg actacctgag cggggcagag gacgtgctca 23700 
tcaagcatct ggcccgccaa tgcatcatcg tcaaggacgc gctgctcgac cgcgccgagg 23760 
tgcccctcag cgtggcggag ctcagccgcg cctacgagcg caacctcttc tcgccgcgcg 23820 
tgccccccaa gcgccagccc aacggcacct gcgagcccaa cccgcgcctc aacttccacc 23880 
cggtcttcgc ggtgcccgag gccctggcca cctaccacct ctttttcaag aaccaaagga 23940 
tccccgtctc ctgccgcgcc a^ccgcaccc gcgccgacgc cctgctcaac ctgggccccg 24000 
gcgcccgcct acctgatatc gcctrcpttgg aagaggttcc caagatcttc g^gggtctgg 24060 
gcagcgacga gactcgggcc gcgaacgctc tgcaaggaag cggagaggag catgagcacc 24120 
acagcgccct ggtggagttg gaaggcgaca acgcgcgcct ggcggtccCc aagcgcacgg 24180 
tcgagctgac ccacttcgcc tacccggcgc tcaacctgcc ccccaaggtc * atgagcgccg 24240 
tcatggacca ggtgctcatc aagcgcgcct cgcccctctc ggaggaggag atgcaggacc 24300 
ccgagagctc ggacgagggc aagcccgtgg tcagcgacga gcagctggcg cgctggctgg 24360 
gagcgagtag caccccccag agcctggaag agcggcgcaa gctcatgatg gccgtggtcc 24420 
tggtgaccgt ggagctggag tgtctgcgcc gcbtcttcgc cgacgcggag accctgcgca 24480 
aggtcgagga gaacctgcac tacctcttca gacacgggtt cgtgcgccag gcctgcaaga 24540 
tctccaacgt ggagctgacc aacctggtct cctacatggg catcct:gcac gagaaccgcc 24600 
tggggcagaa cgtgctgcac accaccctgc gcggggaggc ccgccgcgac tacatccgcg 24660 
actgcgtcta cctgtacctc tgccacacct ggcagacggg catgggcgtg tggcagcagt • 24720 
gcctggagga gcagaacctg aaagagctct gcaagctcct gcagaagaac ctcaaggccc •24780 
tgtggaccgg gttcgacgag cgpacbaccg ccgcggacct ggccgaccM atcttccccg 24840 
agcgcctgcg gctgacgctg cgcaacgggc tgcccgactt tatgagccaa agcatgttgc 24900 
aaaactttcg ctctttcatc ctcgaacgct ccgggatcct gcccg6cacc tgctccgcgc 24960 
tgccctcgga cttogtgccg ctgaccttcc gcgagtgccc cccgccgctc tggagccact 25020 
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gctacct9ct gcgcetggcc aaetacctgg cctaccactc ggacgtgatc gaggacgtca 25080 

scggcgaggg cctgctcgag tgccactgcc gctgcaacct ctgcacgccg cac<ig<stccc 25140 

tggcctgcaa cccccagctg ctgagqgaga cccagatcat cggcaccttc gagttgcaag 25200 

gccccggcga gggcaagggg ggtctgaaac tcaccccggg gctgtggacc tcggcctact 25260 

tgegcaagtt cgtgcccgag gactaocatc cottcgagat caggttctac gaggaccaat. 25320 

cecagccgec caaggccgag ctgtcggcct gcgtcatcac ccagggggcc atcctggccc 25380 

aattgcaagc catccagaaa tcccgccaag aatttctgct gaaaaaggge capggggtct 25440 

acttggaccc ccagaccgga gaggagctca accccagctt cccccaggat gc'iccgagga 25S00 

agcagcaaga agctgaaagt ggagctgccg ccgccgccgg aggatttgga ggakgactgg 25560 

gagagcagtc aggcagagga ggaggagatg gaagactggg acagcactca ggca^aggag 25620 

gacagcctgc aagacagtct ggaggaggaa gaegaggtgg aggaggcaga ggaagaagca 25680 

gccgccgcca gaccgtcgtc ctcggcggag gaggagaaag eaagcagcac ggataccatc 25740 

tccgctccgg gtcggggtcg cggc^gccgg gcccacagta gatgggacga gaccgggegc 25800 

ttcccgaacc ccaccaccea gaccggtaag aaggageggc agggatacaa gtcctggcgg 25860 

sragcacaaaa acgccatcgt ctcctgcttg caagcctgcg ggggcaacat ctccttcacc 25920 

cggcgotacc tgcccttcca ccgcggggtg aacttccccc gcaacatctt gcattactac 25980 

cgtcacctcc acagccccta ctactgtttc ' caagaagagg cagaaaccea gcagcagcag 26040 

cagcagcaga aaaccagcgg cagcagctag aaaatccaca gcggcggcag gtggactgag 26100 

gatcgcggcg aacgageegg cgcagacccg ggagctgagg aaccggatct ttcccaccct 26160 

ctatgccatc ttccagcaga gtcggg&gca agagcaggaa ctgaaagcca agaaccgttc 26220 

tctgcgctcg ctcacccgca gttgtctgta tcaoaagagc gaagaccaac ttcagcgcac 26280 

tctcgaggac gccgaggctc tottcaacaa gtactgcgcg ctcactctta aagagtagcc 26340 

cgcgcccgcc cacacacgga aaaaggcggg aattacgtca ccacctgegc ccttcgcccg 26400 

accatcatca tgagcaaaga gattcccacg ccttacatgt ggagctaoca gccccagatg 26460 

ggcctggccg ccggcgccgc ccaggactac tccacecgca tgaactggct cagtgccggg 26520 

cccgcgatga tctcacgggt gaatgacatc cgcgeccacc gaaaccagat actdctagaa 26580 

cagtcagcga tcaccgccac gcpccgccat caccttaatc cgcgtaattg gcccgccgcc 26640 

ctggtgtacc aggaaacccc ccagcecacg accgtactao ttcegcgaga cgcccaggce 26700 

gaagtccagc tgactaacte aggtgtccag ctggccggcg gcgccgccct gcgtcgtcac 26760 
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C9cccc3ctc agggtataaa gcagctggtg atccgaggca gaggcacaca gctfcaacgac 26820 

gaggtggtga gctcttcgct gggtctgcga cctgacggag tcttccaaet cgcc^atcg 26880 

gggagatctt ecttcacgcc tcgtcaggcc gtcctgactt tggagagttc gtcctcgcag 26940 

ccccgctcgg gtggcatcgg cactctccag ttcgtggagg agftcactcc ctcggtctac 27000 

ttcaacccct tctccggctc ccccggccac tacccggacg agttcatccc gaacttegac 27060 

gccatcagcg agtcggtgga cggctacgat tgaatgtocc atggtggcgc ggctgaccta 27X20 

gctcggcttc gacacctgga ccactgccgc cgcttccgct gcttcgcteg ggatctcgcc 271B0 

gagtttgcct actttgagct gcccgaggag caccctcagg gcccggccca cggagtgcgg 27240 

atcgtcgtcg aagggggtet cgaetcccac ccgcttcgga tcttcagoca gcgtccgacc 27300 

ctggccgagc gcgageaagg acagaccctt ctgaccctgt actgcatctg caaocacccc 27360 

ggcctgcatg aaagtctttg ttgCctgctg tgtactgagt ataataaaag ctgagatcag 27420 

cgactactcc ggacttccgt gtgttcctgo tatcaaccag tccctgttct tcaccgggaa 27480 

cgagaccgag ctccagctcc agtgtiagcc ccacaagaag tacctcacct ggctgttcca 27540 

gggctctccg atcgccgttg tcaaccactg cgacaacgac ggagtcctgc tgagcggccc 27600 

tgccaacctt actttttcca cccgcagaag caagctccag ctcttccaao ccttcctccc 27660 

cgggacctac cagtgcgtct cgggaccctg ccaccacacc ttccacctga tcccgaatac 27720 

cacagcgtcg ctccccgcta ctaScaacca aactacccac caacgccacc gtcgcgacct 27780 

ttcctctggg tctaatacca ctaccggagg tgagctccga ggtcgaccaa cctctgggat 27840 " 

ttactacggc ccctgggagg tggtagggtt aatagcgcta ggcctagttg cgggtgggct 27900 

tttggctctc tgctacctat acctcccttg ctgttcgcac ttagtggtgc tgtgttgctg 27960 

gtttaagaaa tggggaagat caccctagtg agctgcggtg tgctggtggc ggtggtgctt 28020 

tcgattgtgg gactgggcgg cgcggctgta gtigaaggaga aggccgatcc ctgcttgcat 28080 

ttcaatcccg.acaaatgccW gctga^ttt cagcccgatg gcaatcggtg ogcggtgctg 28140 

atcaagtgeg gatgggaatg cgagaacgcg agaatcgagt acaataaeka gactcggaac 28200 

aataetctcg cgtccgtgtg gcagcceggg gaceccgagt ggtacaccgt ctctgtcccc 28260 

ggtgctgaeg getccccgcg caecgtgaat aatacttfcca tttttgcgca eatgtgcgac 28320 

acggtcatgt ggatgagcaa gcagtacgat atgtggcccc ceacgaagga gaacatcgtg 28380 

gtcttctcca tcgcttacag cgtgtgeacg gcgctaatca ccgctatcgt gtgcctgagc 28440 
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attcacatgc tcatcgctat tcgccccaga aataatgccg aaaaagaaaa 'acagccataa • 28500 
cacgtttttt cacacacctt tttcagacca tggcctctgt taaatttttg cttttatttg 28560 
ccagtctcat tgccgtcatt catggaatga gtaatgagaa aattactatt tacactggca 28620 
ctaatcacac attgaaaggt ccagaaaaag ccacagaagt ttcatggtat tgttatttta 28680 
atgaatcaga tgtatctact gaactctgtg gaaacaataa caaaaaaaat gagkgcatta 28740 
ctctcatcaa gtttcaatgt ggatctgact taaccctaat taacatcact agagactatg 28800 
taggtatgta ttatggaact acagcaggca tttcggacat ggaattttat caagtttctg 28860 
tgtctgaacc caccacgcct agaatgacca caaccacaaa aactacacct gttaccacta 28920 
tacagctcac taccaatggc tttcttgcca tgcctcaagt ggctgaaaat agcaccagca 28980 
ttcaacccac cccacccagt gaggaaattc ccagatccat gattggcatt attgbtgctg 29040 
tagtggtgtg catgttgatc atcgccttgt gcatggtgta ctatgccttc tgctacagaa 29100 
agcacagact gaacgacaag ctggaacact tactaagtgt tgaattttaa ttttttagaa 29160 
ccatgaagat cctaggcctt ttagtttttt ctatcattac ctctgctcta tgcaattctg 29220 
acaatgagga cgttact:g.tc gttgtcggat caaattatac actaaaaggt ccagcaaaag 29280 
gtatgctttc gtggtattgt tggttcggaa ctgacgagca acagacagaa ctttgcaatg 29340 
ctcaaaaagg caaaacctica aattctaaaa tctctaatta tcaatgcaat ggcactgact 29400 
tagtattgct caatgtcacg aaagcacatg ccggcagtta cacctgccct ggagatgatg 29460 
ccgacaatat gattttttac aaagtggaag tggttgatcc cactactcca ccgcccacca 29520 
ccacaactac tcataccaca cacacagaac aaacaccaga ggcagcagaa gcagagttgg 29580 
ccttccaggt tcacggagat tcct'ttgctg tcaatacccc tacacccgat caigcggtgtc 29640 
cggggctgct cgtcagcggc attgtcggtg tgctttcggg attagcagtc ataatcatct 29700 
gcatgttcat ttttgcttgc tgctatagaa ggctttaccg acaaaaatca gacccactgc 29760 
tgaacctcta tgtttaattt tt'tccpigagc catgaaggca gttagcgctc tagttttttg 29820 
ttctttgatt ggcantgttt ttagtgctgg gtttttgaaa aatcttacca tttatgaagg 29880 
tgagaatgcc actctagtgg gcatcagtgg tcaaaatgtc agctggctaa aataccatct 29940 
agatgggtgg aaagacattt gcgattggaa tgtcactgtg tatacatgta atggagttaa 30000 
cctcaccatt: actaatgcca cccaagatca gaatggtagg t;ttaagggcc agagttccac 30060 
tagsiaataat gggtatgaat cccataacat gtttatctat gacgtcactg tcatcagaaa 30120 
tgagactgcc accaccacac agatgcccac tacacacagt tctaccacta ctaccatgca 30180 
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aaccacacag acaaccacta catcaactca gcatatgacc accactacag cagc^agcc 30240 
aagtagtgca gcgcctcagc cccaggcttt ggctttgaaa gctgcacaac ctagtacaac 30300 
. tactaggacc aatgagcaga ctactgaatt tttgtccact gtogagagcc acaccacagc 30360 
tacctccagt gccttctcta gcac<;3ccaa tctctcctcg ctttcctcta cacc^atcag -30420 
Ccccgctact actcccaccc cagctcttct ccccactccc ctgaagcaaa ctgaggacag 30480 
cggcatgcaa tggcagatca ccctgctcat tgtgatcggg ttggtcatcc tggccgtgtt 30540 
gccctaetac atcttctgcc gccgcattcc caacgcgcac cgcaaaccgg cctacaagcc 30600 
catcgttatc gggcagecgg agcjcgcttca ggtggaaggg ggtctaagga a^cftcfcctt 30660 
ctcttttaca gtatggtgat tgaactatga ttcctagaca attcttgatc actattctta 30720 
tctgcctcct ccaagtctgu gccaccctcg ctctggtggc caacgccagt ccagactgta 30780 
ttgggccctt cgcctcctac gtgctctttg ccttcatcac ctgcatctgc tgctgtagca 30840 
tagtctgcct gcttatcacc ttcct:ccagt,tcattgactg gatctttgtg cgcatcgcct 30900 
acctgcgcca ccacccccag taccgcgacc agcgagtggc gcggctgctc aggctcctct 30960 
gataagcatg cgggctctgc tacttctcgc gctuctgctg ttagtgctcc cdcgccccgt 3X020 
cgacccccgg tcccccactc agtcccccga agaggtccgp aaatgcaaat tccaagaacc 31080 
ctggaaattc ctcaaatgct acogccaaaa atcagacatg cttcccagct ggatcatgat 31140 
cattgggatc gtgaacattc tggcctgcac cctcatctcc tttgtgattt acccctgctt 31200 
tgactttggt tggaactcgc cagaggcgct ctatctcccg cctgaacctg a«caccacc 31260 
acagcaacct caggcacacg cactaccacc accacagcct aggccacaat acatgcccat 31320 
attagactat gaggccgagc cabagcgacc catgctcocc gctattagtt attccaatct 31380 
aaccggcgga gatgactgac ccactggcca acaacaacgt caacgacctt ctcctggaca 3X440 
tggacggccg cgcctcggag cagcgactcg cccaacttcg cattcgccag cagcaggaga 31500 
gagccgtcaa ggagctgcag gaciggcatag ccatccacca gtgcaagaaa ggoatcttct 3X560 
gcctggtgaa acaggccaag atotcctacg aggtcacccc gaccgaccat cgcctetcct 31620 
acgagctcct gcagcagcgc cagaagttca cctgcctggt cggagtcaac cccatcgtca 3X680 
tcacecagca gtcgggcgat accaaggggt gcatccactg ctcctgcgac tcccccgact 3X740 
gcgtccacac tctgatcaag accctctgog gcctccgcga cctcctcccc atgaactaat 3X800 
caccccctta tccagtgaaa taaatktcat attgatgatg atttaaataa aaaataatca 31860 
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tttgatttga aataaagata ca^tcatatt gatgatttga gttttaaaaa ataaagaatc 31920 

acttacttga aatctgataa caggtctctg tccatgtttt ctgccaacac cacctcactc 31980 

ccctcttccc agctctggta ctgcagaccc cggcgggctg caaacttcct ccacacgctg 32040 

aaggggatgt caaattcctc ctgtccctca atcttcattt tatcttctaf cagatgtcca 32100 

aaaagcgcgt ccgggtggat gatgacttcg accccgtcta cccctacgat gcagacaacg 32160 

caccgaccgt gcccttcatc aaccccccct tcgtctcttc agatggattc caagagaagc 32220 

ccctgggggt gctgtccctg cgac.tggctg accccgtcae caccaagaac ggggaaatca 32280 

ccctcaagct gggagagggg gtggacctcg actcctcggg aaaactcacc tccaacacgg 32340 

ccaccaaggc cgccgcccct ctcagttttt ccaacaacac catttccctt aacatggata 32400 

cccctottta caccaaagat ggaaaattat ccttacaagt ttctccacog ttaaicatat 32460 

taaaatcaao cattctgaac acattagctg tagcttatgg atcaggttta ggactgagtg 32520 

gtggoactgo tcttgcagta cagct^cct ctccactcac ttttgatgaa aiag&aaata 32S80 

ttaaaattaa cctagccagt ggtcfcattaa cagttgatgc aagtcgactt a^tatcaaot 32640 

gcaaaagagg ggtoactgtc actacctcag gagatgcaat tgaaagcaac ataagctggc 32700 

ctaaaggtat aagatttgaa- ggtaa^tggca tagctgcaaa cattggcaga ggattggaat .32760 

ttggaaccac tagtacagag actgatgtca cagatgcata cccaattcaa gttaaattgg 32820 

gtactggcct tacctttgac agtacaggcg ccattgttgc ttggaacaaa gaggatgata 32880 

aacttacatt atggaccaca gccgacccct cgccaaattg caaaatatac tctgaaaaag 32940 

atgccaaact cacactttgc ttgacaaagt gtggaagtca aattctgggt actgtgactg 33000 

tattggcagt gaataatgga agtctcaacc caatcacaaa cacagtaagc actgcactcg 33060 

tctccctcaa gtttgatgca agtggagttt tgctaagcag ctccaoatta gacaaagaat 33120 

attggaactt cagaaaggga gatgttacac ctgctgagcc ctatactaat gctataggtt 33180 

ttatgcctaa cataaaggec tatcctaaaa acacatctgc agcttcaaaa agccatattg 33240 

•tcagtoaagt ttatctcaat ggggatgagg ccaaaceact gatgctgatt att&ctttta 33300 

atgaaactga ggatgcaact tgcacctaca gtatcacttt tcaatggaaa tgggatagta 33360 

ctaagtacac aggtgaaaca cttgptacca gctccttcae cttctcctac atcgcecaag 33420 

aatgaacact gtatcccacc ctgcatgcca acccttccca ecccactctg tetatggaaa 33480 

aaactctgaa gcacaaaata aaataaagtc caagtgtttt attgattcaa cagt'tttaca 33S40 

ggattcgagc agttattttt cctccaccct cceaggaeat ggaatacacc accctctccc 33600 
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cccgcacagc cttgaacatc tgaatgccat ' tggtgatgga catgcttttg gtctccacgt 33660 

tccacacagt ttcagagcga gccagtotcg ggtcggtcag ggagatgaaa ccctccggge 33720 

actcccgcat ctgcacctca c^gctiaaca gctgaggatt gtcetcggtg gtcgggatca 33780 

cggttafctg gaagaagcag aagagcggcg gtgggaatca tagtccgcga acgggatcgg 33840 

ccggtggtgt cgcatcaggc cccgcagcag tcgctgccgc cgccgctccg tcaagctgct 33900 

gctcaggggg tccgggtcca gggactccct cagcatgatg cccacggccc tcagcatcag 33960 

tcgtctggtg cggcgggcgc agcagogcat gcggatctcg ctcaggtcgc tgcagtacgt 34020 

gcaacacagg accaceaggt tgttdaacag tccatagtcc aacacgctcc agccgaaact 34080 

catcgcggga aggatgctac ccacgtggcc gtcgtaecag atcctcaggt aaatcaagtg 34140 

gcgctccctc cagaacacgc tgcccacgta catgatctcc ttgggcatgt ggcggttcac 34200 

cacctcccgg taecacatca ccetctggtt gaacatgcag ccccggatga tcctgcggaa 34260 

ccacagggcc agcaccgcco cgcccgccat gcagogaaga gaccccgggt cccggcaatg 34320 

gcaatggagg acccaccgct cgtacccgtg gatcatctgg gagctgaaca agtctatgtt 34380 

ggcacagcac aggcatatgc teatgcatct cttcagcact ctcagctcct cgggggtcaa 34440 

aaccatatcc cagggcacgg ggaactcttg caggacagcg aaccccgcag aacagggcaa 34S00 

tcctogcaca taacttacat tgtgcatgga cagggtatcg caatcaggca gcaccgggtg 3-4560 

atcctccacc agagaagcgc g^gtbtcggc ctcctcacag cgtggtaagg gggccggccg 34620 

atacgggtga tggcgggacg cggctgatcg tgttcgcgac cgtgtcatga tgcagttgct 34680 

ttcggacatt ttcgtacttg ctgtagcaga acctggtccg ggcgctgcac accgategcc 34740 

ggcggcggtc ccggcgctcg gaacgctcgg tgttgaaatt gtaaaacagc cactctctca 34800 

gaccgtgcag cagatctagg gcctcaggag tgatgaagat cccatcatgc ctgatagctc 34860 

tgatcacatc gaccaccgtg gaatgggcca gaccoagcca gatgatgcaa ttttgttggg 34920 

tttcggtgac ggcgggggag g^aagaacag gaagaaccat gattaacttt taatccaaac 34980 

ggtctcggag caettcaaaa tgaaggtcgc ggagatggea octctcgccc ccgitgtgtt 35040 

ggtggaaaat aacagccagg tcaaaggtga tacggttctc gagatgttcc acggtggctt 35100 

ccagcaaagc otccacgcgc acatccagaa acaagacaat agcgaaagcg gga^ggttct 35160 

ctaattcctc aatcatcatg ttacactcet gcaccatecc cagataattt. tcatttttcc 35220 

agccttgaat gattcgaact agttcctgag gtaaatccaa gccagccatg atdaagagct 35280 
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cgcgcagagc gccctccacc ggcattctta agcacaccct cataattcca agatattctg 35340 

ctcctggttc acctgcagca gattgacaag cggaatatca aaatctctgc cgcgatccct 35400 

aagctcctcc ctcagcaata actgtaagta ctctttcata tcctctccga aatttttagc 35460 

cataggacca ccaggaataa gattagggca ag^ccacagta cagataaacc gaagtcctcc 35520 

ccagtgagca ttgccaaatg caagactgct ataagcatgc tggctagacc cggtgatatc 35580 

ttccagataa ctggacagaa aatcacccag gcaattttta agaaaatcaa caaaagaaaa 35640 

atcctccagg tgcacgttta gagcctcggg aacaacgatg aagtaaatgc aagcggtgcg 35700 

ttccagcatg gttagttagc tgatctgtaa aaaacaaaaa ataaaacatt aaaccatgct 35760 

agcctggcga acaggtgggt aaatcgttct ctccagcacc aggcaggcca cggggtctcc 35820 

ggcgcgaccc tcgtaaaaat tgtcgctatg attgaaaacc atcacagaga gacgttcccg 35880 

gtggccggcg tgaatgatnc gacaagatga atacaccccc ggaacattgg cgtccgcgag 35940 

tgaaaaaaag cgcccgagga agcaataagg cactacaatg ctcagtctca agtccagcaa 36000 

agcgatgcca tgcggatgaa gcacaaaatc ctcaggtgcg tacaaaatgt aattactccc 36060 

ctcctgcaca ggcagcgaag cc'cccgatcc ctccagatac acatacaaag cctcagcgtc 36120 

catagc'ttac cgagcagcag cacacaacag gcgcaagagt cagagaaagg ctgagctcta 36180 

acctgtccac ccgctctctg ctcaatatat agcccagatc tacactgacg taaaggccaa 36240 

agtctaaaaa tacccgccaa ataatcacac acgcccagca cacgcccaga aaccggtgac 36300 

acactcaaaa aaatacgcgc acttcctcaa acgcccaaac tgccgtcatt tccgggttcc 36360 

cacgctacgt catcggaatt cgactttcaa attccgtcga ccgttaaaaa cgtcacccgc 36420 

cccgccccta acggtcgccc gtctctcggc caatcacctt cctccctccc caaattcaaa 36480 

cagctcattt gcatactaac gcgcaccaaa agcttgaggt atattattga tgaitg 36535 

<210> 4 
<211> 34264 
<212> DNA 

<213> simian adenovirus 8V-1 
<400> 4 

tccttattct ggaaacgtgc caatatgata atgagcgggg aggagcgagg cggggccggg 60 
gtgacgtgcg gtgacgtggg gtgacgcggg gtggcgcgag ggcggggcgg gagtggggag 120 
gcgcttagtt tttacgtatg cggaaggagg ttttataccg gaagttgggt aatttgggcg 180 
tatacttgta agttttgtgt aatttggcgc gaaaaccggg taatgaggaa gttgaggtta 240 
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atatgtactt tttatgactg ggcggaattt ccgctgatca gcagtgaact ttgggcgctg 300 

acggggaggt ttcgctacgt ggcagtacca cgagaaggct caaaggtccc atttattgta 360 

ctcctcagcg ttttcgctgg gtatttaaac gctgtcagat catcaagagg ccactcttga 420 

gtgccggcga gtagagtttt ctcctccgcg ctgccgcgat gaggctggtt cccgagatgt 480 

acggtgtttt ctgcagcgag acggcccgga actcagatga gctgcttaat acaigatctgc 540 

tggatgttcc caactcgcct gtggcttcgc ctccgtcgct tcatgatctt ttcgatgtgg 600 

aagtggatcc accgcaagat cccaacgagg acgcggtaaa cagtiatgttc cccgaatgtc 660 

tgtttgaggc ggctgaggag ggttctcaca gcagtgaaga gagcagacgg ggagaggaac 720 

tggacttgaa atgctacgag gaatigtctgc c&tctagcga ttctgaaacg^gaacjsigacag 780 

ggggagacgg ctgtgagtcg gcaatgaaaa atgaacttgt attagactgt ccagaacatc 840 

ctggtcatgg ctgccgtgcc tgtgcttttc atagaaatgc cagcggaaat pctgagactc 900 

tatgtgctct gtgttatctg cgccttacca gcgattttgt atacagtaag taaagtgttt 960 

tcattggcgt acggtagggg attcgttgaa gtgctttgtg acttatt$itg tgtcattatt . 1020 

tctaggtgac gtgtccgacg tggaagggga aggagataga tcaggggctg ctaattctcc 1080 

ttgcactttg ggggctgtgg ttccagttgg catttttaaa ccgagtggtg gaggagaacg 1140 

agccggagga gaccgagaat ctgagagccg gcctggaccc tccagtggaa gactaggtgc 1200 

tgaggatgat cctgaagagg ggactagtgg gggtgctagg aaaaagcaaa aaactgagcc 1260 

tgaacctaga aactttttga atgagttgac tgtaagccta atgaatcggc agcgtcctga 1320 

gacggtgttt tggactgagt tggaggatga gttcaagaag ggggaattaa acctcttgta 1380 

caagtatggg tttgagcagt tgaaaactca ctggttggag ccgtgggagg atatggaaat 1440 

ggctctagac acctttgcta aagtggctct gcggccggat aaagtttaca ctattcgccg 1500 

cactgttaat ataaaaaaga gtgtttatgt tatcggccat ggagctctgg tgc^ggtgca 1560 

gaccccagac cgggtggctt tcaattgcgg catgcagagt ttgggccccg gggtgatagg 1620 

tl:tgaatgga gttacatttc aaaa^gtcag gtttactggt gatgatttta atggctctgt 1680 

gtttgtgact agcacccage taaccctcca cggtgtttac ttttttaact t^aacaatac 1740 

atgtgtggag tcatggggta gggtgtctct gaggggctgc agttttcatg gttgctggaa 1800 

ggcggtggtg ggaagaatta aaagtgtcat gtctgtgaag aaatgcatat ttgaacgctg 1860 

tgtgatagct ctagcagtag aggggtacgg acggatcagg aataacgccg catctgagaa 1920 

tggatgtttt cttttgccga aaggtacggc cagcgttaag cataatatga tttgcggcag 1980 
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cggcctgtgc ccctcgcagc tctta^cttg cgcagacgga aactgtcaca cctigcgcae 204d 

cgtgcacata gtgtcccact cgcgccgcac ctggccaaca tttgagcaca atatgetcat 2100 

gcgttgcgcc gttcacctag gtgctagacg cggcgtgttt atgccttatc iatgtaactt 2160 

tagtcatact aagattttgc tggaaactga ttccttcect cgagtatgtt tcaatggggt 2220 

gtttgacatg tcaatggaao tttttaaagt gataagatat gatgaaacca agtctcgttg 2280 

tcgetcatgt gaatgcggag ct^atcattt gaggttgtat cctgtaaccc tgaacgttac 2340 

cgaggagctg aggacggacc accacatgct gtcttgcctg cgtaccgact atgaatccag 2400 

cgatgaggag tgaggtgagg ggcggkgcca caaagggtat aaaggggcat gaggggtggg 24fi0 

cgcggtgttt caaaatgagc gggacgacgg acggcaatgc gtttgagggg ggagtgctca 2S20 

gcccatatct gacatctcgt cttocttcct gggcaggagt tcgtoagaat gtagtgggct 2580 

ccaccgtgga cggacggccg gtcgcccctg caaattccgc caccctcacc t;atgccaccg 2640 

tgggatcatc gttggaeact gccgcggcag ctgccgcttc tgctgccgct tctactgctc 2700 

gcggcatggc ggctgatttt ggactatata accaactggc cactgcagct gtggcgtctc 2760 

ggtctctggt tcaagaagat gccctgaatg tgatcttfgac tcgcctggag atcatgtcac 2820 

gtcgcctgga cg&actggct gcgcagatat cocaagctaa ccccgatacc gcttcagaat 2880 

cttaaaataa agacaaacaa atttgttgaa aagtaaaatg gctttatttg ttttttttgg 2940 

ctcggtaggc tcgggtccac ctgtctcggt cgttaaggac tttgtgtatg ttttccaaaa 3000 

cacggtacag atgggcttgg afigttcaagt acatgggcat gaggccatct ttggggtgga 3060 

gataggacca ctgaagagcg tc&tgttccg gggtggtatt gcaaatcacc cagtcgtagc 3120 

agggtttttg agcgtggaac tggkatatgt ccttcaggag caggctaatg gccaagggta 3180 

gacccttagt gtaggtgctt acaakgcggt tgagctggga gggatgcacg cggggggaga 3240 

tgatatgcat cttggcttgg attttgaggt tagctatgtt accacccagg tctctgcggg 3300 

ggttcatgtt atgaaggacc accagcacgg tatagccagt gcatttgggg iacttgceat 3360 

gcagtttgga ggggaaggcg tggaagaatt tagatacccc cttgtgccic cctaggtttt 3420 

ccatgcaetc atecataata atggcaatgg gacccctggc ggccgcttta gcaaacacgt 3460 

tttgggggtt ggaaacatca tagitttgct ctagagtgag ctcatcatag gccatcttta 3S40 

caaagegggg taggagggtg ccc^actggg ggatgatagt tccatctggg cctggagcgt 3600 

agttgccctc acagatctgc atctcccagg ccttaatttc cgaggggggg atcatgtcca 3660 
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cctggggggc gataaaaaac aeggtttctg gcggggggtc aatgagctgg gtggaaagca 3720 

agttacgcaa cagctgggat ttgccgcaac cggtgggaec gtagacgacc ccgatgaogg 3780 

gttgcagctg gtagttcaga gaggaacage tgccgtcggg gcgcaggagg ggagctacct 3840 

cattcatcat gcttctgaca tgtttatttt cactcactSa gttttgcaag kgcctctccc 3900 

caeccaggga taagagttct tcdaggctgt tgaagtgttt cagcggtttc aggccgtcgg 3960 

ccatgggcat cttttcaagc gactgacgaa gcaagtacag toggtccoag agbtoggtga 4020 

cgtgctctat ggaatctcga tccagcagac ttcttggttt cgggggttgg gbcgactttc 4080 

gctgtagggc accagccggt gggcgtccag ggccgcgagg gttctgtcct tccagggtct 4140 

cagcgttcgg gtgagggtgg tctcggtgac ggtgaaggga tgagccccgg gct^ggcgct 4200 

tgcgagggtg cgcttcaggc tcatcctgct ggtgctgaag cgggcgtcgt cticcctgtga 4260 

gtcggccaga tagcaacgaa gcatgaggtc gtagctgagg gactcggccg cgttgtccctt 4320 

ggcgcgcagc tttcccttgg aaacgtgctg aeatttggtg cagtgcagac actjrgagggc 4380 

gtagagtttt ggggccaggaagaccgactc gggcgagtag gogtcggcte cgcactgagc 4440 

gcagacggtc tcgcactcca ccagqcacgt gagctcgggt ttagcgggat <Saaaaaccaa 4S00 

gttgoctcfca ttttttttga tgtfgtttctt aecttgcgte tccatgagtc tgtgtcccgc 4560 

ttccgtgaca aaaaggctgt c^tatcccc gtagaccgac ttgagggggc gaccttccaa 4620 

aggtgttccg aggtcttccg cgtacaggaa ctgggaccao tccgagacaa aggctcgggt 4680 

ccaggctaac acgaaggagg cgatctgcga ggggtatctg tcgttctcaa t:gagggggtc 4740 

caccttttcc agggtgtgca gacacaggtc gtcctccCcc gcgtccacga aggtgattgg 4800 

ottgtaagtg taggtcacgt gaoccgcacc cccccaaggg gtataaaagg • gggcgtgccc 4860 

actctccccg tcactttctt ccgcatcgct gtggaccaga gccagctgtt cgggtgagta 4920 

ggccctctca aaagccggca tgatttcggc gctcaagttg tcagtttcta caaacgagge 4980 

ggatttgata ttcacgtgcc ccgcggcgat gettttgatg gtggaggggt ccatccgatc 5040 

agaaaacacg atctttttat tgtcaagttt ggtggcgaaa gacccgtaga gggcgttgga 5.X00 

aageaacttg gcgatggagc gcagggtctg atttttctcc cgatcggccc tctccttggc 5160 

ggcgatgttg agttgcacgt acbcgcgggc eaegcaccgc cacccgggga acacggcggt 5220 

gcgctcgtcg ggcaggatgc gcacgcgcca gecgcggctg Cgcagggtga tgaggtccac 5280 

gotggtggcc acctccccgc ggaggggetc gttggtccaa eacaatcgce cccottttct 5340 

ggageagaac ggaggeaggg gat6tageaa gttggcgggc ggggggtcgg cgtcgatggt 5400 
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aaatatgceg ggtagcagaa ttttattaaa ataatcgatt tcggtgtocg tgtcttgcaa $460 

cgcgtcttcc cacttcttca ccgccagggc cctttcgtag ggattcaggg gcggtcccea 5520 

gggcatgggg tgggtcaggg ccgaggcgta catgccgcag atgccgtaca cgtacagggg ssao 

ctocctcaac accccgatgt aagtggggta acagcg<jccc ccgcggatgc tg^ctcgcac 5640 

gtagtcgtac atctcgtgag agggagccat gagcccgtct cccaagtggg tdtcgcgggg S700 

tttttcggcc cggeagagga tctgcctgaa gatggcgtgg gagttggaag agatagtggg 5760 

gcgttggaag acgttaaagt tggctccggg- cagtcccacg gagtcttgga tgaactgggc 5820 

gtaggattcc cggagcttgt ccaccagggc tgeggttacc agcacgtcga gagcgcagta 5880 

gtecaacgtc tcgcggacca ggttgtaggc cgtctcttgt tttttctccc acagttcgcg 5940 

attgaggagg tattcctcgc ggtctttcca gtactcttog gcgggaaatc ctttttcgtc 6000 

egctcggtaa gaacctaaca tgtaaaattc gttcacggct ttgtatggac aacagccttt 6060 

ttctaccggc agggcgtacg cttgagcggc ctttctgaga gaggtgtggg tgagggcgaa 6120 

ggtgtcccgc accateactt tcaggtactg atgtttgaag tccgtgtcgt cgcaggcgcc 6180 

ctgttcccac agcgtgaagt cggtgcgctt tttctgcctg ggattgggga gggcgaatgt 6240 

gacgtcgtta aagaggattt tcccggcgcg gggcatgaag btgcgagaga tcctgaaggg 6300 

tccgggcacg tccgagcggt cgttgatgao' ttgcgccgcc aggacgatct cgtcgaagcc 6360 

gttgatgttg tggcccacga tgtaaagttc gataaagcgc ggctgtccct tgagggccgg 6420 

cgcttttttc aactcctcgt aggtgagaca gtccggcgag- gagagaecca gctccgcccg 6480 

ggcccagtcg gagagctgag ggttagccgc gaggaaagag ctcoacaggt c^agggotag 6540 

cagagtttgc aagcggtcgc ggaactcgcg aaactttttc cccacggcca ttetctccgg 6600 

cgtcaccacg tagaaagtgc aggggcggtc gttccagacg tcccatcgga gctctagggc 6660 

cagctcgcag gcttgacgaa cgagggtctc otcgcccgag acgtgcatga c«agcatgaa 6720 

gggtaccaac tgtttcccga aegagcecat- ccatgtgtag gtttctacgt cgtaggtgao 6780 
aaagagecge tgggtgcgcg cgtgggagco gatcgggaag aagctgatcf cctgcekcca ' .6840 

gttggaggaa tgggtgttga tgtggtgaaa gtagaagtcc cgccggcgca cag^cacte 6900 

gtgctgacgt ttgtaaaagc gaecgbagta gtcgeagcgc tgcacgctct gtatctcctg 6960 

aatgagatgc gcttttcgcc cgcgcaecag aaaccggagg gggaagttga gacgggggct 7020 

tggtggggcg gcatcccctt cgccttggcg gtgggagtct gcgtctgcgc cctccttcto 7080 



tgggtggacg acggtggaga cgacgacgcc ccgggtgccg caagtecaga tctocgccac 7140 

ggaggggogc aggcgttgca ggag^acg cagctgcccg ctgtccaggg agtcgagggc 7200 

ggccgcgctg aggtcggcgg gaapgtttg caagttcact ttcagaagac cggtaagagc 7260 

. gtgagccagg tgcacatggt aettgatttc caggggggtg ttggaagagg cgtccacggc 7320 

gtagaggagg ccgtgtccgc gcggggocac caccgtgccc cgaggaggtt ttatctcact 7380' 

cgtogagggc gagcgccggg gggtagaggc ggctctgogc cggggggcag cggaggcagt 7440 

ggcacgtttt cgtgaggatt cggcagcggt tgatgacgag occggagact gctggcgtgg 7500 

gcgacgacgc ggcggttgag gtcctg^atg tgccgtctct gcgtgaagac caceggcccc 7560 

cgggtcctga aoctgaaaga gagttccaca gaatcaatgt ctgcaccgtt aacggcggcc 7620 

tgcctgagga tctcctgtac gtcgcccgag ttgtcttgat aggcgatctc ggccatgaec 7680 

cgotccactt cttcctcgcg gaggtcgccg tggcccgctc gctccacggt ggcggccagg 7740 

tcgttggaga tgcgacgcat gagttgagag aaggcgttga ggccgttctc gttccacacg 7800 

oggctgtaca ccacgtctcc gaaggagtcg cgcgctcgca tgaccaoctg ggccacgttg 7860 

agttccacgt ggcgggcgaa ga^ggc^tag tttctgaggc gctggaagag gtagttgagc 7920 

gtggtggcga tgtgctcgca gacgaagaag tacatgatcc agcgccgcag ggtcatctcg 7980 

ttgatgtctc cgatggcttc gagacgctcc at^gcctcgc agaagtcgac ggcgaagttg 8040 

aaaaattggg agttgcgggc ggccaccgtg agttcttctt gcaggaggcg gatgagatcg 8100 

gcgaccgtgt cgcgcacctc ctgctcgaaa gcgccccgag gcgcctctgc ttcttcctcc 8160 

ggctcctcct cttccagggg cacgggttcc tccggcagct ctgcgacggg gacggggcgg 8220 

cgacgtcgtc gtctgaccgg caSgcggtcc acgaagcgct cgateatttc gccgcgccgg 8280 

cgaegcatgg tctcggtgac ggcgcgtccg ttttcgcgag gtcgcagttc gaagicgccg 8340 

ccgcgcagag cgcccccgtg cagggagggt aagtggttag ggccgtcggg cagggacacg 8400 

gcgctgacga tgcattttat caattgctgc gtaggcactc cgtgcaggga tctgagaacg 8460 
tcgaggtcga cgggatccga gaac^tctct aggaaagcgt ctatccaatc gcagtcgcaa . 8S20 

ggtaagctga ggacggtggg ccgctggggg gcgtccgcgg gcagttggga ggtgatgctg 8580 

ctgatgatgt aattaaagta ggcggtcttc aggcggcgga tggtggcgag gaggaceacg 8640 

tctttgggcc cggcctgttg a^tgcgcagg cgctcggcca tgccccaggc cc<ictetga 8700 

cagcgacgca ggtctttgta gtagtcttgc ateagtctct ccaccggaac cectgcttct 8760 

cccctgtctg ccatgcgagt cgagccgaac ccccgcaggg gctgcagcaa cgctaggtcg 8820 
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gccacgaccc tctcggccag cacggcctgt tggatctgcg tgagggtggt ctggaagtcg 8880 

tccaggtcca cgaagcggtg ataggccccc gtgttgatgg tgtaggtgca gttggccatg 8940 

acggaccagt tgacgacttg catgccgggt tgggtgatct ccgtgtactt gaggcgcgag 9000 

taggcgcggg aetcgaacac gtagt<igtt3 catgtgcgta ccatfatactg gtagccaacc soeo 

aggaagtggg gaggcggtte tcggtacagg ggccagccga ctgtggcggg ggcgccgggg 9120 

gacaggecgt ccagcatgag gcgatggtag tggtagatgt agcgggagag ccaggtgatg 9X80 

ccggccgagg tggtcgcggc cctggtgaat tcgcggacgc ggttccagat gttgcgcagg 9240 

gggcgaaagc gctccatggt gggcac^ctc tgccccgtga ggcgggcgca atcttgtacg 9300 

ctctagatgg aaaaaagaca gggcggteat cgactccctt ccgtagctcg gggggtaaag 9360 

tcgcaagggt gcggcggcgg ggaacdccgg ttcgagaccg gccggatccg ccgctpccga 9420 

tgcgcctggc cccgcatcca cgacgtccge gtcgagacec agccgcgacg ctccgcccca 9480 

atacggaggg gagtcttttg gtgttttttc gtagatgcat ccggtgctgc ggeagatgcg 9540 

acctcagacg cccaccacca ccgccgcggc ggcagtaaac ctgagcggag gcggtgacag 9600 

ggaggaggag gagctggctt tagacctgga agagggagag gggctggccc ggctgggagc 9660 
gccgtcccca gagagacacc ctagcfettca gctcgtgagg gacgccaggc aggottttgt " 9720 

gccgaagcag aacctgttta gg^accgcag cggucaggag gcggaggaga tgcgcgattg 9780 

caggtttcgg gcgggtagag agct^agggc gggcctcgat cgggagcggc tcctaagggc 9840 

ggaggatttc gagcccgacg agbgtitctgg ggtgagcecg gcccgcgctc acgtctcggc 9900 

ggccaacctg gtgagcgcgt acgagdagac ggtgaacgag gagogcaact tccaaaagag 9960 

ctttaacaat cacgtgagga ccpfgatcgc gagggaggag gtgaccatcg ggctgatgca 10020 

tctgtgggac ttcgtggagg cctacgtgca gaacccggoc agcaaacctc tgacggocca X0080 

gctgttcctg atcgtgcagc ac'agccgcga caacgagacg ttcogcgacg ccatgttgaa 10140 

•catcgcggag cccgagggtc gctggctctt ggatctgatt aacatcctgc agagcatcgt 10200 

ggtgcaggag aggggcctca gittagcgga caaggtggcg gccattaact attcgatgca 10260 

gagcctgggg aagttctacg ctogbaagat ctacaagagc ccttaegtgc ccatkgacaa 10320 

ggaggtgaag atagacagct tttacatgcg catggcgctg aaggtgctga cgctgagcga 10380 

cgacctcggc gtgtaccgta acgacaagat ccacaaggcg gtgagcgcca gccgccggcg 10440 

ggagctgagc g^cagggagc tgatgcacag cctgcagagg gcgctggcgg gcgccgggga 10500 
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ogaggagcgc gaggcttact tcgacatggg agccgatctg cagtggcgtc ccagcgcgcg 10S6O 
cgccttggag gcggcggget accccgacga ggaggaccgg gacgatttgg aggaggcagg 10620 
cgagtacgag gacgaagcct gaccgggcag gtgttgtttt agatgcagcg gccggcggac 10680 
ggggccaccg cggatccegc acttttggca tccatgcaga gtcaaccttc gggcgtgacc 10740 

gcctccgatg actgggcggc ggccatggac cgcattatgg cgctgactac ccgcaacccc 10800 

gaggctttta gacagcaacc ccaggccaac cgtttttcgg ccatcttgga agcggtggtg 10860 

ccctcccgca ccaaccccac acacgagaaa gtcctgacta tcgtgaacgc cptg^tagac 10920 

agcaaggcca tccgccgcga cgaggcgggc ttgatttaca acgctctgct ggaacgggtg 10980 

gcgcgctaca acagcactaa cgttcagacc aacctggatc gcctcaccac cgacgtgaag 11040 

gaggcgctgg ctcagaagga gcggtttctg agggacagca atctgggotc tct'^tggca. llioo 

. ctcaacgcct tcctgagcae gcagccggcc aaegtgccco gcgggcagga ggactacgtg mso 

agfcttcatca gcgctctgag gctgctggtg tccgaggtgc occagagoga ggtgtatcag 11220 

tctgggccgg attacttctt ccagacgtcc cgacagggct tgcaaaeggt gaacctgact 11280 

caggccttta aaaacttgca aggcatgtgg ggcgttaagg ccccggtggg cgatcgagcc 11340 

aocatctcca gtctgctgac ccccaacact cgcctgctgc tgctcttgat cgcgccgttc 11400 

aceaacagta gcactatcag ccgtgactcg tacctgggtc atctcat;cac tttgtaccgc 11460 

gaggccatcg gtcaggctca g^tcgacgag caeacatatc aggagatcac taacgtgagc 11520 

cgggccctgg gtcaggaaga taccggcagc ctggaagcca cgttgaactt tttg^taacc lisao 

aaecggaggc aaaaaatacc ctcccagttt acgttaagcg ccgaggagga gaggattctg ii640 

egatacgtgc agcagtccgt gagtctgtac ttgatgcggg agggcgccac bgcttccacg 11700 

gctttagaca tgacggctcg gaacatggaa ccgtcctttt actccgccca ccggccgttc 11760 

attaaccgtc tgatggacta cttccatcgc gcggcc^cca tgaacgggga gtacttcacc 11820 

aatgccatcc tgaatcogca tcggatgccc ccgtccggct tctacaccgg c^agtttgac 11880 

ctgcccgaag ccgacgacgg ctttctttgg gacgacgtgt ccgacagcat tttcacgccg 11940 

ggcaatogcc gattccagaa gaaggagggc ggagaogagc tccccctctc cagpgtggag laooo ' 

goggcctcta ggggagagag tccctttccc agtctgtctt ccgccagcag feggtcgggta 12060 

acgcgcccgo ggttgcoggg ggagagcgac tacctgaacg accccttgct gc^gccggct 12120 

aggaagaaaa atttccccaa caacggggtg gaaagcttgg tggataaaat gaatcgttgg 12I8O 

aagacctacg cccaggagca gcgggagtgg gaggaeagtc agccgcgacc gctggttccg 12240 
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ccgcactggc gtcgtcagag agaag^cccg gacgactccg cagacgatag tagcgtgttg 12300 

gaectgggag ggagcggagc oaaccccttt gctcacttge aacccaaggg gcgttccagt 12360 

•cgcctctact aataaaaaag acgc^aac ttaccagagc catggccaca gcgtgtgtcc 12420 

tttcttccte tctttcttcc tcggcg'cggc agaatgagaa gagcggtgag agioacgccg 12480 

gcggcgtatg agggtccgcc cccttcttac gaaagcgtga tgggatcagc gaacgtgccg 12540 

gccacgotgg aggcgectta cgttectccc agatacctgg gacctacgga gggcagaaac 12600 

agcatccgtt actcbgagct ggcacocctg tacgatacca ccaaggtgta cctgitggac 12660 

aacaagtogg cggacatcgc ctccetgaat tatcaaaacg atcacagcaa ttttctgact 12720 

accgtggtge agaacaatga cttcaceccg acggaggcgg gcacgcagac cattaacttt 12780 

gacgagcgtt cccgctgggg cggtcagctg aaaaccatcc tgcacaccaa catgcccaac 12840 

atcaacgagt tcatgtccao caaoaagtto aggsccaggc tgatggttaa aaaggccgaa 12900 

aaccagcctc ccgagtacga at^gtttgag ttcaccattc ccgagggcaa ctattccgag X29S0 

accatgacta tcgatctgat gaacaatgcg atcgtggaea attacctgca agtggggagg X3020 

cagaacgggg tattggaaag cgatatcggc gtaaaatttg ataccagaaa cttccgactg 13080 

gggtgggato ccgtgaccaa gctggtgatg ccaggcgtgt acaccaacga ggctt'ttcac 13140 

coogacatcg tgctgctgcc ggggtgcggt gtggacttca ctcagagccg tttgagtaac 13200 

otgttaggga tcagaaagcg ccgccccttc caagagggct ttcagatoat gtatgaggac 13260 

ctggaaggag gtaacattcc aggtttgcta gacgtgccgg cgtatgaaga gagtgttaaa 13320 

caggcggagg cgcagggacg agagattcga ggegacacct ttgccacgga accfecaogaa 13380 

ctggcaataa aaoctctgga acaagacagt aaaaaacgga gttacaacat tatatccggc 13440 

actatgaata ccttgtaccg gagctggttt ctggcttaca actaegggga tcccgaaaag 13500 

ggagtgagat catggaccat actcaccacc acggacgtga cctgcggctc gcagcaagtg 13560 

tactggtcce tgccggatat gatgcaagac cdggtcacct tccgcccctc cacccaagto 13620 

agcaacttcc cggtggtggg caaigagetg ctgcccgtcc atgccaagag cttetaeaac 13680 ' 

gaacaggecg tctactcgca actcattcgc cagtocaccg cgcttaccca cgtgttcaat 13740 

cgctttcccg agaaccagat tctggtgcgc cctcccgctc ctaccattac ciccrgteagt 13800 

gaaaacgctc cegccctcac agatcacgga accctgccgc tgcgcagcag tatcagtgga 13860 

gttcagogcg tgaccatcac cgafcgccaga cgtcgaacct gtccctacgt ttacaaagct 13920 



73 



cttggcgtag tggctcctaa agtgctctct agtcgcacct tctaaaeatg tceatcctca 13980 

tctctcccga caacaacacc ggctggggac tgggctccgg caagatgtac ggcggagcca 14040 

aaaggcgctc cagtcagcac ccagttcgag ttcggggcca cttccgtgct ccct^gggag i4ioo 

cttacaagcg aggactctcg ggfccgaacgg cggtagacga taccatagat gccgtgattg 141S0 

ccgacgcccg ccg^tacaac cccggaccgg tcgctagcgc cgcctccacc gtggattccg 14220 

tgatcgacag cgtggtagct ggcgctcggg ccCatgctcg cc^caagagg cggctgcatc 14280 

ggagacgtcg ccccaccgcc gccatgctgg cagccagggc cgtgctgagg cgg^cccgga 14340 

gggtaggcag aagggctatg cgccgegctg ccgccaacgc cgccgccggg agggcccgce 14400 

gaeaggctgc ccgcoaggot gotgccgcca tcgctagcat ggccagaccc aggagaggga 14460 

acgtgtactg ggtgcgcgat tctgtgacgg gagtccgagt gccggtgcgc agecgacctc 14S20 

cccgaagtta gaagatccaa gctgcgaaga cggcggtact gagtctccct gttgktatca 14S80 

gcccaacatg agcaagcgca agtttaaaga agaactgctg cagacgctgg tgcctgagat 14640 

ctatggccct ccggacgtga agoctgacat taagccccgc gatatcaagc gtgttaaaaa 14700 

gcgggaaaag aaagaggaac tcgcggtggt agacgatggc ggagtggaat ttattaggag 14760 

tttcgccccg egacgcaggg ttcaatggaa agggcggcgg gtacaacgcg ttttgaggcc 14820 

gggcaccgcg gtagttttta ccccgiegaga goggtcggcc gttijggggtt tca^aggea 14880 

gtaogacgag gtgtacggcg acga^acat attggaacag gcggctcaac agatcggaga 14940 

atttgcctac ggaaagcgtt ogcgtcgcga agacctggcc atcgctttag adascggcaa ISOOO 

ccccacgccc agcctcaaac ctgtgacgot gcagcaggtg ctccccgtga gcgccagcac 1S060 

ggacageaag aggggaataa aaagagaaat ggaagatctg cagcccacca tccigctoat 15120 

ggtccctaaa cggcagaggc tggaagaggt cctggagaaa atgaaagtgg acccaagcat 15180 

agagccggac gtcaaagtca ggccgatcaa agaagtggco octggtctcg gggtgcagac 15240 

ggtggatato cagatccccg tcacgtcagc ttcgacogcc gtggaagcca tggaaacgca 1S300 

aacggaaaco cctgcogcga tcggtaccag ggaagtggcg ttgcaaaccg acccctggta 15360 

cgaatacgcc gcocctcggc gtcagaggcg acccgctcgt tacggccccg ccaacgccat 15420 

catgccagaa tatgcgctgc atQcgtotat cctgcccacc cccggctacc ggg^agtgac 15480 

gtatcgcccg tcaggaaccc gccgcjcgaac ccgtcgccgc cgccgctccc gtcgtgctct 15540 

ggcocccgtg tcggtgcgcc gogtaacacg ccggggaaag acagttacca ttcccaaccc 15600 

gcgctaccac cctagcatcc tttaatgaet ctgccgtttt gcagatggct .ctgacttgcc 15660 



74 



gcgtgcgcct tcccgttccg cactatcgag gaagatctcg tcgtaggaga ggcatggcgg 15720 
9t:agtggtcg ccggcgggct ttgcgcaggc gcatgaaagg cggaatttta cccgptctga 15780 
tacccataat cgccgcegcc atcggtgcca taeccggcgt cgcttcagtg gccttUaag 15840 
cagctcgtaa taaataaacg aaggcttttg cacttatgtc ctggtcctga ctattttatg 15900 
cagaaagagc atggaagaca tcaattttac gtcgctggct ccgcggcacg gctcgcggcc 15960 
gctcatgggc acctggaacg acatcggcac cagtcagctc aacgggggcg ctttcaattg 16020 
SSegagcctt tggagcggca ttaaaaactt tggctccacg attaaatcct acggcagcaa isoso 
agcctggaac agtagtgctg gtcagatgct ccgagataaa ctgaaggaca ccaacttcca 16140 
agaaaaagtg gtcaatgggg tggtgaccgg catccacggc gcggtagatc tpgccaacca 16200 
agcggtgcag aaagagattg acaggcgttt ggaaagctcg cgggtgccgc cgcagagagg 16260 
ggatgaggtg gaggtcgagg aagtagaagt agaggaaaag ctgcccccgc tgg^gaaagt 16320 
tcccggtgcg cctccgagac cgcagaagcg acccaggcca gaactagaag aaaetctggt 16380 
gacggagagc aaggagcctc cctcgtacga goaagccttg aaagagggog cctctccacc. 16440 
ctacccaatg acaaaaccga tcgcgcctat ggctcggccg gtgtacggga aggactacaa 16500 
gcctgtcacg ctagagctcc- cc^ogccgcc accgccgccc cccacgcgcc cgkccgttcc 16560 
cccccccctg ccggctcegt cggcgggacd cgtgtccgca cccgtcgccg tgcctctgcc 16620 
agccgcccgc ceagtggccg tggc^ctgc cagaaaeccc agaggccaga gaggagccaa 16680. 
ctggcaaagc acgctgaaca gcatiogtggg cctgggagtg aaaagcctga aacgecgccg 16740 
ttgctafctat taaaagtgta gdtaaaaaaf ^tcccgttgt atacgcctcc ta.:gttaccg 16800 
ceagagacgc gtgactgtcg ccgcgagcgc cgctttcaag atggccaccc catcgatgat 16860 
gccgcagtgg tcttacatgc acatcgccgg gcaggacgcc tcggagtaec tgagccccgg 16920 
tctcgtgcag tccgcccgcg ccaocgacac ctacttcagc ttgggaaaca agtttagaaa 16980 
ccccaccgtg gcccccaccc acgatgtaac cacggaccgc tcgcaaaggc tgaccctgcg 17040 
ttttgtgccc gtagaccggg aggkcaccgc gtacfecttac aaagtgcgct acacgctggc 17100 
cgtaggggac aaccgagtgc tggacatggc cagcacctac tttgacatcc ggggagtgct 17160 
ggatcgcggt cccagcttta agccgtactc gggtaccgcg tacaattccc tggctcccaa 17220 
gggcgctccc aaccctgeag aatggacgaa ttcagacagc aaagttaaag tgagggcaoa 17280 ' 
ggcgcctttt gttagctcgt atggtgctac agcgattaca aaagagggta ttcaggtggg 17340 
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agtaacctta acagactccg gatcaacacc acagtatgca gataaaacgt atcagcctga 17400 

gccgcaaatt ggagaactac agtggaacag cgatgttgga accgatgaca aaatagcagg 17460 

aagagtgcta aagaaaacaa cgcccatgtt cccttgttac ggctcatatg ccaggcccac 17520 

taatgaaaaa ggaggacagg caacaccgtc cgctagtcaa gacgtgcaaa atcccgaatt 17580 

acaatttttt gcctctacta atgtcgccaa tacaccaaaa gcagttctat atgcggagga 17640 

cgtgtcaatt gaagcgccag acactcactt ggtgttcaaa ccaacagtca ctgaaggcat 17700 

tacaagttca gaggctctac tgacccaaca agctgctcec aaccgtccaa actacatagc 17760 

ctttagagat aattttattg gtctcatgta ctacaatagc acaggtaaca tgggagtact 17820 

ggcaggccag gcttctcagc taaatgcagt tgttgacctg caagacagaa atactgagct 17880 

gtcctaccaa ctcatgttgg acgcoctcgg agaccgcagt cggtactttt ctatgtggaa 17940 

ccaagctgtg gatagttacg atco^gatgt aagaatcata gaaaaccatg gcgtagaaga 18000 

tgaattgcct aattattgct ttcctttggg aggcatggea gtaaccgaca cctactcgcc 18060 

tataaaggtt aatggaggag gcaacggatg ggaagccaat aacggcgttt tcaccgaaag 18120 

aggagtggaa ataggttcag ggaacatgtt tgccatggag attaacetgo aagccaacct 18180 

atggogtagc tttctgtact. ecaatattgg gctgtacctg ccagactctc tcaaaatcac 18240 

tcctgacaac atcacactcc cagagkacaa aaaoacctat cagtatatga acggtcgcgt 18300 

gacgccaccc gggctggttg acaoctacgt taacgcgggc gcgcgctggt cccocgatgt 18360 

catggacagt attaaccctt ttaat.cacca ccgcaacgcc ggactccgct accgttccat 18420 

gctcetggga aacggaogct aeg,:gccctt ccacatccag gtgccccaga aattctttgc- 18480 

aattaaaaac ctgctgctgc tccccggttc ctacacctac gagtggaact tccgoaagga 18540 

cgtgaacatg atcttgcaga gctcgctggg caatgaoctg cgagtggacg gggecageat 18600 

ccgcttcgac agcatcaacc tgtacgccaa ctttttcccc atggcccaca acacggcctc 18660 

caccotggaa gccatgctgc gcaacgacac caacgaeeaa tctttcaacg actacctgtg 18720 

cgcggccaac atgctgtacc ccatccccgc caacgccacc agcgCgccca tct.ccattcc 18780 

ctctcgcaac tgggcagcct tcaggggctg gagtttcaec cgcctcaaaa ccaaggagac 18840 

cccctcgctg ggctccgggt tcgaccocca cttcgtctac tccggctcca t^ccctacct 18900 

ggacggcacc ttctacctca accatkettt caaaaaggtg tcaaCcatgt tcgactcetc 18960 

cgtcagctgg cccggcaacg accgtctgct gacgcecaac gagttcgaaa tcaagcgttc 19020 

39tggacggt gaagggtaca acgtggctca gagcaacatg aceaaggacc ^ttoctgat 19080 
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tcagatgctc agccactaca acatcggota ccagggcttc tacgtgcceg aaaattacaa 19140 

ggaccgcatg tactctttct tcagaaactt ccaacccatg agccgecaaa ttgtagatte 19200 

aacggcttac actaattatc aggatgtgaa actgceatac cageataaea actcagggtt 19260 

cgtgggctac-atgggaccca ccatgcgaga ggggcaggcc tacccggcca actafcccta 19320 

tcccctgatt ggggccaccg ccgtgeccag cctcacgcag aaaaagttoc tctgcgaccg 19380 

S9tgatgtgg aggatcccct tctfetagcaa cttcatgtct atgggctccc tcaccgacct 19440 

ggggcagaac atgctgtacg ccaa'ctecgc tcacgccttg gatatgacct ttgaggtgga 19500 
tcccatggat gagcccacgc ttctctatgt tctgtttgaa gtcttcgacg tggtgcgcat ' 19560 

ccaccagccg eaecgcggcg tcatcgaggc cgcctacctg cgcacacctt tctctgccgg 19620 

taaogccacc acetaaagaa gccgatgggc tccagcgaac aggagctgca ggccattgtt 19680 

cgcgacctgg gctgcgggcc ctactttttg ggcaccttcg acaagcgttt tcccggcttc 19740 

atgtccccec acaagccggc otgcgccatc gctaacacgg ccggacggga gaccgggggg isaoo 

gtccactggc tcgccttcgc ctggaacccg cgtaaccgca cctgctacct gfetcgaccct 19860 

fcttggtttct ccgacgaaag gctgaagcag atctaccagt tcgagtacga ggggctcctc 19920 

aagcgcagcg ctctggcctc cacgcccgac cactgcgtca " ccctggaaaa gtccacccaa 19980 

acggtccagg ggcccctctc ggccgcctgc gggctcttct gctgcatgtt cttgcacgcc 20040 

ttcgtgcact ggcctcaoao ccccatggat cacaacccca ccatggatct gctcaccgga 2O100 

gtgcccaaca gcatgcttca cagccccckg gtcgccccca ccctgogccg taaccaggaa 20160 

cacctgtatc gotttctggg gaaapactct gcctattttc gccgccaccg gcagcgcatc 20220 

gaacgggcca cggccttcga aagcatgagc caaagagtgt aatcaataaa aaacattttt 20280 

atttgacatg ataogcgctt ctggcgtttc attaaaaatc gaagggttcg agggaggggt 20340 

cctcgtgccc gctggggagg gacacgttgc gatactggaa acgggcgctc 4a<^aaact 20400 

cggggatcac cagccgcggc aggggcacgt cctctaggtt ctgcttccaa aactgccgca- 20460 

ccagctgcag ggctcceatg ac&tcgggcg ccgatatott gaagtcgcag ttagggccgg 20520 

agctccogcg gctgttgcgg aacacggggt tggcacactg gaaeaecagc adgocggggt 20580 

tgtggatact ggceagggcc gtcgggtcgg tcacctccga cgcatccaga tcctcggcgt 20640 

tgctcaggge aaacggggtc agctttgcaca tctgccgccc aatccggggt actaggtcgo 20700 

gcttgttgag gcagtcgcag cgcagaggga tcaggatgcg tcgctgcccg cgttgcatga 20760 
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tagggtaact cgccgccagg aactcctcca tttgacggaa ggceatctgg gctttgccgc 20820 

cctcggtgta gaatagccc^ caggacttgc tagagaabac gttatgaccg cagttgacgt 20880 

cctccgcgca gcagcgggcg tcttcSSttct tcagctgaac cacgttgcgg ccccaacggt 2O940 

tctggaccac cttggctcta gtggggtgct ecttcagogc ccgctgtccg ttctcgctgg 21000 

ttacatceat ttccaacacg Cgctccttgc agaccatctc cactccgtgg aagcaataaca 21060 

ggacgccctc ctgctgggta ctgcgatgct cccatacggc gcatccggtg ggctdceagc 21120 

tcttgtgttt tacccccgcg taggcttcca tgtaagccat aaggaatctg cccatcagct 21180 

oggtgaaggt cttctggttg gtgaaggtta gcggcaggcc gcggtgctcc tcgttcaacc 21240 

aagtttgaca gatcttgcgg tacaccgctc cctggtcggg cagaaactta aaagccgctc 21300 

tgctgtcgtt gtctacgtgg aacttctcca ttaacatcat catggtttcc atacccttct 21360 

eccacgctgt caccagtggt ttgctgtcgg ggttcttcac caaoacggcg gtagaggggc 21420 

cctcgccggc cccgacgtcc ttdatggtca ttctttgaaa ctccacggag ccgtcogcgc 21480 

gacgtactct gcgcaccgga gggtiagctga agcccacctc caccacggtg ccttcgccct 21540 

cgctgtcgga gacaatctcc ggggatggcg gcggcgcggg tgtcgecttg ogigccttct 21600 

tcttgggagg gagctgaggo gcctcitgct cgcgctcggg gctcatctcc cgcaagtagg 21660 

gggtaatgga gctgcctgct tggttctgae. ggttggccat- tgtatcotag gcagaaagac 21720 

atggagctta tgcgcgagga aactttaace gccccgtccc ccgtcagcga cgaagatgta- 21780 

atcgtcgaac aggacccggg ct&qgttacg ccgcccgagg atctggaggg gcctgaccgg 21840 

cgcgacgcta gtgagcggca ggaaaatgag aaagaggagg cctgctacct cctggaaggc 21900 

gacgttttgc taaagcattt cgcca^cag agcaccatag ttaaggaggc cctgcaagac 21960 

cgctccgagg tgcccttgga cgtcgccgog ctctcccagg cctacgaggc gaaccttttc 22020 

togoctogag tgcetccgaa gagacagccc aacggcacct gcgagcccaa cccgcgactc 22080 

aacttctacc ccgtgttcgc cgtac6agag gcgctg^cca cctatoacat tt:t:j:ttcaaa 22140 

aacoaacgca tccccctatc gtgicgggcc aaccgcaccg cggccgatag gaatctcagg 22200. 

cttaaaaacg gagccaacat acctgatatc acgtcgctgg aggaagtgcc c^agattttc 22260 

gagggtctgg gtcgagatga gaagcgggcg gcgaacgctc tgcagaaaga acagaaagag ' 22320 

agtcagaacg tgctggtgga gctggagggg gacaacgcgc gtctggccgt cctcaaacgc 22380 

tgcatagaag tctcccactt cgbctacccc gccctcaact tgccacccaa agttatgaaa 22440 

tcggtcatgg atcagctgct catcaagaga gctgagcccc tggatcccga ccaccccgag 22S00 
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gcggaaaact cagaggacgg aaagcccgtc gtcagcgacg aggagctcga gcggtggctg 22560 

gaaaccaggg acccccaaca gttgcaagag aggcgcaaga tgatgatggc ggccgtgctg 22620 

gtcaccgtgg agctggaatg cctgcaacgg tttttcagcg acgtggagac gctacgcaaa 22680 

atcggggaat ccctgcacta cacct^ccgc cagggctacg tccgccaggc ctgcaagatc 22740 

tccaacgtgg agctcagcaa cbtggtctcc tacatgggca tcctccacga gaaccggctg 22800 

gggcagagcg tgctgcactg caccttgcaa ggcgaggcgc ggcgggacta cgtgcgagac 22860 

tgcatctacc tcttcctcac cctcacctgg cagaccgcca tgggcgtctg gcagcagtgc 22920 

ttggaagaga gaaacctcaa agagctagac aaactcctct gccgccagcg gcgcgccctg 22980 

tggtccggtt tcagcgagcg cacggtcgcc agcgctctgg cggacatcat cttcccggag 23040 

cgcctgatga aaaccttgca aaacggcctg ccggatttca tcagtcaaag cattttgcaa 23X00 

aacttccgct cttttgtcct ggaacgctcc gggatcttgc ccgccatgag ctgcgcgcta 23160 

ccttctgact ttg.tccccct ctpcpaccgc gagtgccctc ccccactgtg gagccactgc 23220 

tacctcttcc aactggccaa cttti:Gtggcc taccactccg acctcatgga agacgtaagc 23280 

ggagagggtt tactggagtg ccactgccgc tgcaacctgt gcacccccca cagatcgctg 23340 

gcctgcaaca ccgagctact cagcgaaacc caggtcatag gtaccttcga gatcdagggg 23400 

ccccagcagc aagagggtgc ttccggcttg aagctcactc cggcgctgtg gacctcggct 23460 

tacttacgca aatttgtagc cgaggactac cacgcccaca aaattcagtt ttacgaagac 23520 

caatctcgac caccgaaagc ccccctcacg gcctgcgtca tcacccagag caagatcctg 23580 

gcccaattgc aatccatcaa ccaagcgcgc cgcgatttcc tcttgaaaaa ggg^cggggg 23640 

gtgtacctgg acccccagac cggcgdggaa ctcaacccgt ccacactcto cgtcgaagca 23700 

gcccccccga gacatgccgc ccaagggaac cgccaagcag ctgatcgctc ggcagagagc 23760 

gaagaagcaa gagctgctcc agcagcaggt ggaggacgag gaagagatgt gggacagcca 23820 

ggcagaggag gtgtcagagg acgaggagga gatggaaagc tgggacagcc tagacgagga ,23880 

SSaggacgag ctttcagagg aagaggcgac cgaagaaaaa ccacctgcat ccag.cgcgcc 23940 

ttctctgagc cgacagccga agcccc^gcc cccgacgccc ccggccggct cactcaaagc 24000 

cagccgtagg tgggacgcca ccgaatctcc agcggcagcg gcaacggcag cgggtaaggc 24060 

caaacgcgag cggcgggggt attgctcctg gcgggcccac aaaagcagta ttgtgaactg 24120 

cttgcaacac tgcgggggaa acatctcctt tgcccgacgc tacctcctct tcca.tcacgg 24180 
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tgtggccttc cctcgcaacg ttctctatta ttaccgtcat etctacagcc cctacgaaac 24240 
gctcggagaa aaaagctaag gcctcQtccg ccgcgaggaa aaactccgcc gccgctgccg 24300 
ccgccaagga tccaccggcc accgaagagc tgagaaagcg catctttccc actctgtatg 24360 
ctatctttca gcaaagccgc gggcagcacc ctcagcgcga actgaaaata aaaaaccgct 24420 
ccttccgctc gctcacccgc agctgtctgt accacaagag agaagaccag ctgcagcgca 24480 
ccctggacga cgccgaagca ctgttcagca aatactgctc agcgtctctt aaagactaaa 24540 
agacccgcgc tttttccccc tcggcc^cca aaacccacgt catcgccagc atgagcaagg 24600 
agattcccac cccctacatg tggagctatc agccccagat gggcctggcc gcgggggccg 24660 
cccaggacta ctccagcaag atgaactggc tcagcgccgg cccccacatg atctcacgag 24720 
ttaacggcat ccgagcccac cgaaaccaga ttctcttaga acaggcggca atcaccgcca 24780 
caccccggcg ccaactcaac ccgcctagtt ggcccgccgc ccaggtgtat caggaaaatc 24840 
cccgcccgac cacagtcctc ctgdcacgcg acgcggaggc cgaagtcctc atgactaact 24900 
ctggggtaca attagcgggc gggtccaggt acgccaggta cagaggtcgg gccgctcctt 24960 
actctcccgg gagtataaag agggtgatca ttcgaggccg aggtatccag ctcaacgacg 25020 
agacggtgag ctcctcaacc ggtctcagac ctgacggagt cttccagctc ggaggagcgg 25080 
gccgctcttc cttcaccact cgc.caggcct acctgaccct gcagagctct tcctcgcagc 25140 
cgcgctccgg gggaatcggc actctccagt tcgtggaaga gttcgttccc tccgtctacf 25200 
tcaacccctt ctccggctcg cctggacgct acccggacgc cttcattccc aact'ttgacg 25260 
cagtgagtga atccgtggac ggctacgact gatgacagat ggtgcggccg tgagagctcg 25320 
gctgcgacat ctgcatcact gccgtcagcc tcgctgctac gctcgggagg cgatcgtctt 25380 
cagctacttt gagctgccgg acgagcaccc tcagggtccg gctcacgggt tgaas^ctcga 25440 
gatcgagaac gcgctcgagt ctcgcctcat cgacaccttc accgcccgac ctctcctggt 25500 
agaaatccaa cgggggatca ctaccatcac cctgttctgc atctgcccca cgcccggatt 25560 
acatgaagat ctgtgttgtc atctttgcgc tcagtttaat aaaaactgaa ctttctgccg. 25620 
cacctbcaac gccatctgtg atttctacaa caaaaagttc ttctggcaaa ggcacacaaa 25680 
ctgtatttta ttctaattct acptcatcta tcgtgctgaa ctgcgcctgc actaacgaac 25740 
ttatccagtg gattgcaaac ggtagtgtgt gcaagtactt ttgggggaac gatatagtta 25800 
gtagaaataa cagcctttgc gagcactgca actcctccac actaatcctt tat^ccccat: 25860 
ttgttactgg atggtatatg tgcgttggct ccggtttaaa tcctagttgc tttcataagt 25920 
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ggtttctaca aaaagagacc cttcccaaca attctgtttc ttttttcgcc ctatcctact 25980 

gctgttctcc ctctggttac tctttcaaac ctctaattgg tattttagct ttgatactca 26040 

taatctttat taactttata ataattaaca acttacagta aacatgcttg ttctactgct 26100 

cgccacatct ttcgctctct ctcacgccag aacaagtatt gteggcgcag gttacaatgc 26160 

aactcttcaa tctgcttaca tgccaigattc cgaccagata ccccatatta cgtggtactt 26220 

acaaacctcc aaacctaatt cttcatttta tgaaggaaac aaactctgcg artgactccga 26280 

caacagaacg cacacatttc cccacccttc actacaattc gaatgcgtaa acaaaagctt 26340 

gaagctttac aacttaaagc cttcagattc tggcttgtac catgctgtag ttgaaaaaag 26400 

taatttagaa gtccacagtg attacattga attgacggtt gtggacctgc cacctccaaa 26460 

atgcgaggtt tcctcctctt accttgaagt tcaaggcgtg gatgcctact gcctcataca 26520 

cattaactgc agcaactcta aatatecagc tagaatttac tataatggac aggaaagtaa 26580 

tcttttttat tatttaacaa caagcgctgg taacggtaaa cagttacctg actattttac 26640 

tgctgttgtt gaattttcca cc'tacagaga aacgtatgcc aagcggcctt aeaatttctc 26700 

atacccgttt: aacgaccttb gcaatgaaat acaagcgctc gaaactggaa ctgattttac 26760 

tccaattttc. attgctgcca ttgttgtaag cttaattacc attattgtca gcctagcatt 26820* 

ttactgcttt tacaagccca aaaaccctaa gtttgaaaaa cttaaactaa aacc^gtcat 26880 

tcaacaagcg tgattttgtt ttccagcatg gtagctgcat ttctacttct cctctgtcta 26940 

cccatcattt tcgtctcttc aactttcgcc gcagtttccc acctggaacc agagtgccta 27000 

ccgccttttg acgtgtatct gattctcacc tttgtttgtt gtatatccat ttgcagtata 27060 

gcctgctttt ttataacaat: ctttcaagcc gccgactatt tttacgtgcg aattgcttac 27120 

tttagacacc atcctgaata cagaaatcaa aacgttgcct ccttiactttg tttggcatga 27180 

ttaagttatt gctgatactt aactatttac ccctaatcaa ctgtaattgt ccattcacca 27240 

aaccctggtc attctacacc tgttaitgata aaatccccga cactcctgtt gcttggcttt 27300 

acgcagccac cgccgctttg gtatttatat ctacttgcct tggagtaaaa ttgtatttta 27360 

ttttacacac tgggtggcta catcccagag aagatttacc tagatatcct cttgtaaacg 27420 

cttttcaatt acagcctctg cctcctcctg atcttcttcc tcgagctccc tctattgtga 27480 

gctactttca actcaccggt ggagatgact gactctcagg acattaatat tagtgtggaa 27540 
agaatagctg ctcagcgtca gcgac^aaacg cgagtgttgg aatacctgga actacagcaa 27600 
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cttaaagagt cccactggtg tgagaaagga gtgctgtgcc atgttaagca ggcagccctt 27660 

tcctacgatg tcagcgttca gggacatgaa ctgtcttaca ctttgccttt gcagaaacaa 27720 

accttctgca ccatgatggg ctctacctcc atcacaatca cccaacaagc cgggcctgta 27780 

gagggggcta tcctctgtca ctgtcacgca cctgattgca tgtccaaact aatcaaaact 27840 

ctctgtgctt taggtgatat ttttaaggtg taaatcaata ataaacttac cttaaatttg 27900 

acaacaaatt tctggtgaca tcat.tcagca gcaccacttt accctcttcc cagctctcgt 27960 

atgggatgcg atagtgggtg gcaaacttcc tccaaaccct aaaagaaata ttggtatcca' 28020 

cttccttgtc ctcacccaca attttcatct tttcatagat gaaaagaacc agagttgatg 28080 

aagacttcaa ccccgtctac cccdatgaca ccacaaccac tcctgcagtt ccctttatat 28140 

cacccccctt tgtaaacagc gatggtcttc aggaaaaccc cccaggtgtt ttaagtctgc 28200 

gaatagctaa acccctatat ttcgacatgg agagaaaact agcccttitca cttggaagag 28260 

ggttgacaat taccgccgcc ggagaattag aaagtacgca gagcgtacaa accaacccac 28320 

cgttgataat taccaacaac aacacactga ccctacgtca ttctcccccc ttaaacctaa 28380 

ctgacaatag cttagtgcta ggctactcga gtcctctccg cgtcacagac aacaaactta 28440 

catttaactt cacatcacca ctccgttatg aaaatgaaaa ccttactttt aactatacag 28500 

agcctcttaa acttataaat aacagccttg ccattgacat caattcctca aaaggcctta 28560 

gtagcgtcgg aggctcacta gctgtaaacc tgagttcaga cttaaagttt gacagcaacg 28620 

gatccatagc ttttggcata caaaccctgt ggaccgctcc gacctcgact ggcaactgca 28680 

ccgtctacag cgagggcgat tccctactta gtctctgttt aaccaaatgc ggagctcacg 28740 

tcttaggaag tgtaagttta accggbttaa caggaaccat aacccaaatg actgatattt 28800 

ctgtcaccat tcaatttaca tttg^caaca atggtaagct actaagctct ccacttataa 28860 

acaacgcctt tagtattcga cagaatgaca gtacggcctc aaaccctacc tacaacgccc 28920 

tggcgtttat gcctaacagt accatatatg caagaggggg aggtggtgaa ccacgaaaca 28980 

actactacgt ccaaacgtat cttaggggaa atgtbcaaaa accaatcatt cttactgtaa 29040 

cctacaactc agtcgccaca ggatattcct tatcttttaa gtggactgct cttgcacgtg 29100 

aaaagtttgc aaccccaaca acctcgtttt: gctacattac agaacaataa aaccgtgtac 29160 

cccaccgttt cgtttttttc agatgaaacg ggcgagagtt gatgaagact tcaacccagt 29220 

gtacccttat gaccccccac atgctqctgt tatgcccttc attactccac cttttacctc 29280 

ctcggatggg ttgcaggaaa aaccacttgg agtgttaagt ttaaactaca gagatcccat 29340 
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tactacgcaa 


aatgagtctc 


Ctacaattaa 


actaggaaac 


ggcctcactc tagacaacca 


29400 


gggacaacta 


acatcaaccg 


Gtggcgaagt 


agaacctcca 


ctcactaacg ctaacaacaa 


29460 


acttgcactg 


gtctatagcg 


atcctttagc 


agtaaagcgc 


aacagcctaa ccttatcgca 


29520 


caccgctccc 


cttgttattg 


ctgataactc 


tttagcattg 


caagtttcag agcctatttt 


29580 


tataaatgac 


aaggacaaac 


tagccctgca 


aacagccgcg 


ccccttgtaa ctaacgctgg 


29640 


cacccttcgc 


ttacaaagcg 


ccgccccttt 


aggcattgca 


gaccaaaccc taaaactcct 


29700 


gtttaccaac 


cctttgtact 


tgcagaataa 


ctttctcacg 


ttagccattg aacgaccccc 


29760 


tgccattacc 


aatactggaa 


agctggctct 


acagctctcc 


ccaccgctac aaacagcaga 


29820 


cacaggcttg 


actttgcaaa 


ccaacgtgcc 


attaactgta 


agcaacggga ccctaggckt 


29880 


agccataaag 


cgcccactta 


ttattcagga 


caacaacttg 


tttttggact tcagagctcc 


29940 


cctgcgtcct 


ttcaacagcg 


acccagtact 


agggcttaac 


ttttacaccc ctcttgcggt 


30000 


acgcgatgag 


gcgctcactig 


ttaacacagg 


ccgcggcctc 


acagtgagtt acgatggttt 


30060 


aattttaaat 


cttggtaagg 


atcttcgctt 


tgacaacaac 


accgtttctg tcgctcttag 


30120 


tgctgctttg 


cctttacaat 


acactgatca 


gcttcgcctt 


aacgtgggcg ctgggctgcg 


30180 


ttacaatcca 


gtgagtaaga 


aattgfgacgb 


gaaccccaat 


caaaacaagg gtttaacctg 


30240 


ggaaaatgac 


tacctcattg 


taaagc.tagg 


aaatggatta 


ggttttgatg gcgatggaaa 


30300 


catagctgtt 


tctcctcaag 


ttacatcgcc 


tgacacctta 


tggaccactg ccgacccatc 


30360 


ccccaattgt 


tccatctaca 


ctgatttaga 


tgccaaaakg 


tggctctcgt tggtaaaaca 


30420 


agggggtgtg 


gttcacggtt 


ccgctgcttt 


aaaagcattg 


aaaggaaccc tattgagtcc 


30480 


tacggaaagc 


gccattgtua 


ttatactaca 


ttttgacaat 


tatggagtgc gaattctcaa 


30540 


ttatcccact 


ttgggcactc 


aaggcacgtt 


gggaaataat 


gcaacttggg gttataggca 


30600 


gggagaatct 


gcagacacta 


atgtactcaa 


tgcactagca 


tttatgccca gttcaaaaag 


30660 


gtacccaaga 


999cgtggaa 


gcgaagttca 


gaatcaaact 


gtgggctaca cttgtataca 


30720 


gggtgacttt 


tctatgcccg 


tacqgtacca 


aatacagtac 


aactatggac caactggcta 


30780 


ctcctttaaa 


tttatttgga 


gaactgtttc 


aagacaacca 


tttgacatcc catgctgttt 


30840 


tttctcttac 


attacggaag 


aataaaacaa 


ctttttcttt 


ttattttctt ttta^ttttac 


30900 


acgcacagta 


aggcttcctc 


cacccttcca 


tctcacagca 


tacaccagcc tctccecct.t 


30960 


catggcagta 


aactgttgtg 


agtcagtcog 


gtatttggga 


gttaagatcc aaacagtctc 


31020 
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tttggtgatg aaacatggat ccgtgatgga cacaaatccc tgggacaggt tctccaacgt 31060 

titcggtaaaa aactgcatgc cgccctacaa aacaaacagg ttcaggctct ccacgggtta 31140 

tctccccgat caaactcaga cagagtaaag gtgcgatgat gttccactaa accacgcagg 31200 

tggcgctgtc tgaacctctc ggtgcgactc ctgtgaggct ggtaagaagt tagattgtcc 31260 

agcagcctca cagcatggat catcagtcta cgagtgcgtc tggcgcagca gpgcatctga 31320 

atctcactga gattccggca agaatcgcac accatcacaa tcaggttgtt catgatccca 31380' 

tagccgaaca cgctccagcc aaagctcatt cgctccaaca gcgccaccgc gtgtccgtcc 31440 

aaccttactt taacataaat caggtgtctg ccgcgtacaa acatgctacc cgcatacaga 31500 

acctcccggg gcaaacccct gt-tcaccacc tgcctgtacc agggaaacct cacatttatc 31560 

agggagccat agatagccat tttaaaccaa ttagctaaca ccgccccacc agctctacae 31620 

tgaagagaac cgggagagtt acaatgacag tgaataatcc atctctcata acccctaatg 31680 

. •* 

gtctgatgga aatccagatc taacgtggca cagcagatac acactttcat at:acattttc 31740 

*k * 

atcacatgtt Lttcccaggc cgttaaaata caatcccaat acacgggcca ctcctgcagt 31800 

acaataaagc taatacaaga tggtajtactc ctcacctcac taacattgtg catgttcata 31860 

ttttcacatt ctaagtaccg agagttctcc tctacaacag cactgccgcg gtcctcacaa 31920 

, ggtggtagct* ggtgacgatt gtaaggagcc agtctgcagc gataccgtct gtcgcgttgc 31980 

atcgtagacc agggaccgac gc'acctcctc gtacttgtag tagcagaacc acgtccgctg 32040 

ccagcacgtc tccaagtaac gccggtccct gcgtcgctca cgctccctcc tcaacgcaaa 32100 

gtgcaaccac tcttgtaatc cacacagatc cctctcggcc tccggggcga tgcacacctc 32160 

aaacctacag atgtctcggt acagttccaa acacgtagtg agggcgagtt ccaaccaaga 32220 

cagacagcct gatctatccc gacacactgg aggtggagga agacacggaa gaggcatgtt 32280 

attccaagcg attcaccaac gggtcgaaat gaagatcccg aagatgacaa c'ggtcgcctc 32340 

cggagccctg atggaattta acagccagat caaacattat gcgattttcc aggctatcaa 32400 

•tcgcggcctc caaaagagcc tggacccgca cttccacaaa caccagcaaa gcaaaagcgt 32460 

tattatcaaa ctcttcgatc atcaagctgc aggactgtac aatgcccaag taattttcat 32520 

ttctccactc gcgaatgatg tcgcggcaaa tagtctgaag gttcatgccg tgcatattaa 32580 

aaagctccga aagggcgccc tctatagcca tgcgtagaca caccatcatg actgcaagat 32640 

atcgggctcc tgagacacct gcagcagatt taacagaccc aggtcaggtt gctctccgcg 32700 

atcgcgaatc tccatccgca aagtcatttg caaataatta aatagatctg cgccgactaa 32760 
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atctgttaac 


tccgcgctag 


gaactaaatc 


aggtgtggct acgcagcaca aaagttccag 


32820 


ggatsgcgcc 


aaactcacta 


gaaccgctcc 


cgagtagcaa aactgatgaa ' tgggagtaac 


32880 


acagtgtaaa 


atgttcagcc 


aaaaatcact 


aagctgctcc tttaaaaagt ccagtacttc 


32940 


tatattcagt 


tcgtgcaagt 


actgaagcaa 


ctgtgcggga atatgcacag caaaaaaaat 


33000 


agggcggctc 


agatacatgt 


tgacctaaaa 


taaaaagaat cattaaacta aagaagcctg 


33060 


gcgaacggtg 


ggatatatga 


cacgctccag 


cagcaggcaa gcaaccggct gtccccggga 


33X20 


accgcggtaa 


aattcatccg 


aatgattaaa 


aagaacaaca gagacttccc accatgtact 


33180 


cggttggatc 


tcctgagcac 


ag^gcaatac 


ccccctcaca ttcataticcg ctacagaaaa 


33240 


aaaacgtccc 


agatacccag 


cggga'atatc 


caacgacagc tgcaaagaca gc'aaaacaat: 


33300 


ccctctggga 


gcaatcacaa 


aatcctccgg 


tgaaaaaagc acatacatat tacfaataacc 


33360 


ctgttgctgg 


ggcaaaaagg 


cccgtcgtcc 


cagcaaatgc acataaatat gttcatcagc 


33420 


cattgccccg 


tcttaccgcg 


taaapagcca 


cgaaaaaatc gagctaaaat ccacQcaaca 


33480 


gcctatagcc 


atatatacac 


tccacccaat 


gacgctaata ccgcaccacc cacgaccaaa 


33540 


gttcacccac 


acccacaaaa 


cccgpgaaaa 


tccagcgccg tcagcac&cc cgcaatttca 


33600 


gtctcacaac 


gtcacttccg 


cgcgcctttt 


cactttccca cacacgccct tcgdccgccc 


33660 


gccctcgcgc 


caccccgcgt 


caccccacgt 


caccgcacgt caccccggcc ccg'cctcgct 


33720 


cctccccgct 


cattatcata 


ttggc&cgtt 


tccagaataa ggtatattat tgatgcagca 


33780 


aaacaatccc 


tctgggagca 


atcacaaaat 


cctccggtga aaaaagcaca tacatattag 


33840 


aataaccctg 


ttgctggggc 


aaaaaggccc 


gtcgtcccag caaatgcaca taaatatgtt 


33900 


catcagccat 


tgccccgtct 


taccgcgtaa 


acagccacga aaaaatcgag ctaasiatcca 


33960 


cccaacagcc 


tatagctata 


tatacactcc 


acccaatgac gctaataccg caccacccac 


34020 


gaccaaagtt 


cacccacacc 


cacaaaa'ccc 


gcgaaaatcc agcgccgtca gcacttccgc 


34080 


aatttcagtc 


tcacaacgtc 


acttccgcgc 


gccttttcac tttcccacac acgccctitcg 


34140 


cccgcccgcc 


ctcgcgccac 


cccgcgtcac 


cccacgtcac cgcacgtcac ccdggccccg 


34200 


cctcgctcct 


ccccgctcat 


tatcatattg 


gcacgtttcc agaataaggt atattattga 


34260 


tgca 








34264 



<210> 5 
<211> 31044 
<212> DNA 
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<213> simian adenovirus SV-25 

<400> 5 ^ • . . 

catcatcaat: aatatacctt attctggaaa cgtgccaata tgataatgag cggggaggag 60 

cgaggcgggg ccggggtgac gt;gcggtgac gcggggtggc gcgagggcgg ggcgaagggc 120 

.gcgggtgtgt gtgtgggagg cgcttagttt ttaegtatgc ggaaggaggt: tttataccgg I80 

aagatgggta atttgggcgt atacttgtaa gttttgtgta atttggcgcg aaaactgggt 240 

aatgaggaag ttgaggttaa. tatgtacttt ttatgactgg gcggaatttc tgctgatcag 300 

cagtgaactt tgggcgctga cggggaggtt tcgctacgtg acagtaccac gagaaggctc 360 

aaaggtccca tttattgtac tcttcagcgt tttcgctggg tatttaaacg ctgtcagatc 420 

atcaagaggc cactcttgag tgcjtggcgag aagagttttc tcctccgtgc tgccacgatg 480 

aggctggtcc ccgagatgta cggtgttttt agcgacgaga cggtgcgtaa ctcagatgac 540 

ctgctgaatt cagacgcgct ggaaatttcc aattcgcctg tgctttcgcc gccgtcactt 600 

cacgacctgt ttgtgttttg gctcaacgct tagcaacgtg ttatataggg tcaagaagga 660 

gcaggagacg cagtttgcta ggct^ttggc cgatactcct ggagtttttg tggc'tctgga 720 

tctaggccat cactctcttt tccaagagaa aattatcaaa aacttaactt ttacgtctcc 780 

tggtcgcacg gttgcttccg ctgcctttat tacctatatt ttggatcaat gg'agcaacag 840 

cgacagccac ctgtcgtggg agtacatgct ggattacatg tcgatggcgc tgtggagggc ' 900 

catgctgcgg aggagggttt gcatttactt gcgggcgcag cctccgcggc tggaccgagt 960 

* * ! 

ggaggaggag gacgagccgg gggagaccga gaacctgagg gccgggctgg accctccaac X020 

ggaggactag gtgctgagga tgateccgaa gaggggacta gt^gggctag gaagaagcaa 1080 

aagactgagt ctgaacctcg aaactttttg aatgagttga ctgtgagttt gatgaatcgt 1140 

cagcgtccgg agacaatttt ctggtctgaa ttggaggagg aattcaggag gggggaaccg 1200 

aacctgctat acaagtatgg gtttgaacag ttaaaaactc actggttgga gccgtgggag 1260 

gattttgaaa ccgccttgga cacttttgct aaagtggctc tgcggccgga taaggtttac ;L320 

actatccgcc gcactgttaa cataaagaag agtgtttatg ttataggcca tggagctctg 1380 

gtgcaggtgc- aaaccgtcga ccgggtggcc tttagttgcg gtatgcaaaa tctgggcccc 1440 

ggggtgatag gcttaaatgg tgtaacattt cacaatgtaa ggtttactgg tgaaagtttt 1500 

aacggctctg tgtttgcaaa taacacacag ctgacgctcc acggcgttta cttttttaac 1560 

tttaataaca catgtgtgga gtcgtggggc agggtgtctt tgaggggctg ctgttttcac 1620 
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1680 
1740 



ggctgctgga aggcggtggt gggaagactt aaaagtgtaa catctgtaaa aaaatgegtg 
tttgagcggt gtgtgttgge tttaactgtg gagggctgtg gacgcattag glataatgcg 

gcgtctgaga atggatgttt tcttttgcta aaaggcacgg ctagtattaa gcataacatg leoo 

atatgcggoa gcggtctgta cccttcacag ctgttaactt gcgcggatgg aaactgteag i860 

acettgcgca ccgtgcacab agcgtcccae cagcgccgcg cctggccaac a.ttpgagcac 1920 

aatatgctta tgcgttgtgc cgtccacttg ggccctaggo gaggegtgtt tgtgccttac 1980 

cagtgtaact ttagecatac caagatttta ctagaacctg ataccttctc tcgkgtgtgt 2040 

ttcaatgggg tgtttgacat gtcaatggaa ctgtttaaag tgataagata tgatgaatcc 2100 

aagtctcgtt gtcgcccatg tgaatgcgga gctaatcatc tgaggttgta tccfcgtaacc 2160 

ctaaacgtca ccgaggagct gaggacggat caccacatgt tgtcctgcct gcgcaccgac 2220 

tatgaatcca gcgacgagga gtgaggtgag gggcggagcc acaaagggta taaaggggcg 2280 

tgaggggtgg gtgtgatgat tda^aatgag cgggacgacg gacggcaacg cgtttgaggg 2340 

tggagtgttc agcccttatc tgacatctcg tcttccttcc tgggdaggag fgcgtcagaa 2400 

tgtagtgggc tccaccgtgg acggacgacc ggtcgcccct gcaaattccg ccaocctcac 2460 

ctatgccacc gtgggatcat cgttSfgacac Cgccgcggca gotgccgett ctgctgccgc 2S20 

ttctaotgct cgcggcatgg cggctgattt cggactgtat aaccaactgg ccactgcagc 2580 

tgtggcgtct cggtctctgg ttc^gaaga tgcoctgaat gtgatcctga ctcgcctgga 2640 

gatcatgtca cgtcgcttgg acgaactgge tgcgcagata tcccaagota accccgatac 2700 

cacttcagaa tcctaaaata aagacaaaca aatatgttga aaagtaaaat ggctttattt 2760 

gttttttttg gctcggtagg ctcgggtcca cctgtctcgg tcgttaagaa ctttgtgtat 2820 

gttttccaaa acacggtaca gatgggcttg gatgttcaag tacatgggca tgaggccatc 2880 

tttggggtga agataggacc attga.agagc gtcatgctcc ggggtggtgt tgtaaattac 2940 

ccagtcgl^ag cagggtttct gggcgtggaa ctggaagatg tcctttagga gtaggctgat 3000 

ggccaagggc aggcccttag tgtaggtgtt tac'aaagcgg ttaagctggg agggatgcat 3060 

gcggggggag atgatatgca Ccttggcttg gatcttgagg ttagctatgt t4ccacccag 3120 

gtctctgogg gggttcatgt tatgaaggac caccagcacg gtgtagccgg tgcatttggg 3X80 

gaacttgtca tgcagtttgg aggggaaggc gtggaagaat ttagagaccc -cctpgtggcc 3240 

ccctaggttt tccatgcact catccataat gatggcaatg ggacccctgg cggccgcttt' ^300 

ggcaaacacg ttttgggggt tggaaacatc atagttttgc totagagtga gctcatcata 3360 
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ggccatctta acaaagcggg gtaggagggt gcccgactgg gggatgatag ttccatctgg 
gcctggggcg tagttaccct cacagatctg catctcccag gccttaattt ccgA^ggggg 
tatcatgtcc acctgggggg caataaagaa cacggtttct pgcgggggat tgatgagctg 
ggtggaaagc aagttacgca gcagfctgaga tttgccacag ccggtggggc egtagatgac 
cccgatgacg ggttgcagct ggtagttgag agaggaacag otgocgtcgg ggdgcaggag 
gggggctacc tcattcatea tgctcccaac atgtttattt tcactcacta agttttgcaa 
gagcctctcc ccacccaggg ataagagttc ttccaggctg ttgaagtgtt tcageggttt 
taggccgtcg gccatgggca tcttttcgag cgactgacga agcaagtaca gtcggtccoa 
gagctcggtg acgtgctcta tggaatctcg atccagcaga cttcttggtt gcgggggttg 
ggtcgacttt cgctgtaggg caccagccgg tgggcgtcca gggccgcgag ggtrt.ctgtcc 
ttccagggtc tcagcgtccg ggtga^tg gtctcggtga cggtgaaggg atgSgccccg 
93Ctgggcgc ttgcgagggt gcgcttcagg ctcatcctgc tggtgctgaa gcggacgtcg 
tctccctgtg agtcggccag atagcaacga agcatgaggt cgtagctgag ggactcggcc 
gcgtgtccct tggcgcgcag ctttcccttg gaaacgtgct gacatttggt gcagtgcaga 
cattggaggg cgtagagttt ggg^gccagg aagaccgact cgggcgagta ggc^tcggct' 
ccgcactgag cgcagacggt ctcgcactcc actagccacg tgagctcggg tttagcggga 
tcaaaaaoca agttgcctcc attttttttQ atgcgtttct taccttgcgt ttccatgagt 
ttgtggcccg cttccgtgac aaaaaggctg tcggtgcetc cgtagacaga cttgaggggg 
cgatcttcca aaggtgttcc gaggtcttcc gcgtacagga actgggacca ctccgagacg 4500 
aaggctctgg tccaggctaa cacgaaggag gcaatotgcg aggggtatet gtcgttttca 4560 
atgagggggt ccaccttttc ca^gtgtgc agacacaggt cgtcctcctc cgcgtccacg 
aaggtgattg gcttgtaagt gtaggtcacg tgatctgcac cccccaaagg ggtataaaag 
ggggcgtgcc caecctctec gtcactttct tccgeatcgc tgtggaccag agccagctgt 
tcgggtgagt aggccctetc aaaagccggc atgatcccgg* cgctcaagtt gtcagtttct 
acaaacgagg tggatttgat attcacgtgc cccgcggcga tgcttttgat ggtggagggg 
tccatctgat cagaaaacac gatctttttg ttgtcaagtt tggtggcgaa agacccgtag 
agggcgttgg aaagcaactt ggcgatggag cgcagggtct gatttttcte ccgatcggcc 4980 
ctctccttgg cggcgatgtt g^gttgcacg tactcccggg ccgcgcaccg ccactcgggg S040 
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aacacggcgg tgcgctcgtc gggcaggatg cgcacgcgcc agccgcgatt gtgcagggtg 5100 

atgaggtcca cgctggtagc cacctccccg cggaggggct cgttggtcca acacaatcgc 5160 

cccccttttc tggagcagaa cggaggcagg ggatctagca agttggcggg cggggggtcg 5220 

gcgtcgatigg tgaagatacc gggtagcagg atcttattaa aataatcgat ttcggtgtcc 5280 

gtgtcttgca aogcgtcttc ccacttcttc accgccaggg ccctttcgta gggattcagg 5340 

ggcggtcccc agggcatggg gtgggtcagg gccgaggcgt acatgccgca gatgtcatac 5400 

acgbacaggg gttccctcaa caccccgatg taagtggggt aacagcgccc ccccfcggatg 5460 

ctggctcgca cgtagtcgta catctcgcgc gagggagcca tgaggccgtc tcccaagtgg 5520 

gtcttgtggg gtttttcggc ccggtagagg atctgtctga agatggcgtg ggagttiggaa 5580 

gagatggtgg ggcgttggaa gsic^ttaaag ttggccccgg gtagtcccac ggagtcttgg 5640 

atgaactggg cgtaggattc ccggagtttg tccaccaggg cggcggtcac cagcacgtcg 5700 

agagcgcagt agCccaacgt cticgcggacc aggttgtagg ccgtctcttg ttttttctcc 5760 

cacagttcgc ggttgaggag gtattcctcg cggtctttcc agtactcttc ggcgggaaat 5820 

cctttttcgt ccgctcggta agaacctaac atgtaaaatt cgttcaccgc tttgtatgga 5880 

caacagcctt tttctaccgg cagggcgtac gcttgagcgg cctttctgag agaggtgcgg 5940 

gtgagggcga aggtgtcccg caccatcact ttcaggtact gatgtttgaa gtccgtgtcg 6000 

tcgcaggcgc cctgttccca cagcgtgaag tcggtgcgct ttttctgcct gggattgggg 6060 

agggcgaagg tgacatcgtt aaagagtatt ttcccggcgc ggggcatgaa gttgcgagag 6120 

atcctgaagg gcccgggcac gtccgagcgg ttgttgatga cc'tgcgccgc caggacgatc 6180 

tcgtcgaagc cgttgatgtt gtgacccacg atgtaaagtt cgatgaagcg cggctgtccc 6240 

ttgagggccg gcgctttittt caactcctcg taggtgagac agtccggcga ggagagaccc 6300 

agctcagccc gggcccagtc gga:gagttga ggattagccg caaggaagga gctccataga 6360 

tccaaggcca ggagagtttg caagcggtcg cggaactcgc ggaacttttt ccccacggcc 6420 

attttctccg gtgtcactac gtaa^aggtg ttggggcggt tgttccacac gtcccatcgg 6480 

agctctaggg ccagctcgca ggcttggcga acgagggtct cctcgccaga gacgtgcatg 6540 

accagcataa agggtaccaa ctgtttcccg aacgagccca tccatgtgta ggtttctacg 6600 

tcgtaggtga caaagagccg ctgggtgcgc gcgtgggagc cgatcggaaa gaagctgatc 6660 

tcctgccacc agctggagga atgggtgtta atgcggtgga agtagaagtc ccgccggcgc 6720 

acagagcatt cgtgccgatg tttgftaaaag cgaccgcagt agtcgcagcg - ctgcacgctc 6780 
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tgtatctcct gaacgagatg cgctcttcgc ccgogcacca gaaaccggag ggggaagttg 
agacgggggg ctggtggggc gacatcccct tcgccttggc ggtgggagtc tgcgfcctgcg 
ccctccttct ctgggtggac gacggtgggg acgacgacgc cccgggtgcc gcaagtccag 6960 
atctccgcca cggaggggtg caggcgitgc aggaggggac gcagctgccc gctgtccagg 7020 
gagtcgaggg aagtcgcgct gaggtcggcg ggaagcgttt gcaagttcac tttcagaaga 7080 
ccggtaagag cgtgagccag gtgcagatgg tacttgattt ccaggggggt gttggatgaa 
gcgtccacgg cgtagaggag tccgtgtccg ogcggggoca ccacegtgcc ccgaggaggt 
tttatctcac tcgtcgaggg cgab|S:gccgg ggggtagagg cggctctgcg ccgg^ggca 7260 
gcggaggcag aggcacgttt tcgtgaggat tcggcagcgg ttgatgacga gcccggagac 7320 
tgctggcgtg ggcgacgaog cggcggttga ggtcctggat gtgccgtctc tgcgtgaaga 
ccaccggccc ccgggtcctg aacctaaaga gagttccaca gaatcaatgt ctgcatcgtt 
aacggcggcc tgcctgagga tctcctgcac gtcgcccgag ttgtcctgat aggcgatctc 
ggccatgaac tgttccactt cttcctcgcg gaggtcaccg tggcccgctc gotccacggt 
ggcggccagg tegttggaga tgcggcgcat gagttgagag aaggcgttga ggccgttctc 
•gttccacacg cggctgtaca ccac^tttcc gaaggagtcg cgcgctcgca tgaccacctg 
ggccacgttg agttccacgt ggcgggcgaa gacggcgtag tttctgaggc gctggaagag 7740 
gtagttgagc gtggtggcga tgtgctcgca gacgaagaag tacataatcc agcgccgcag 7800 
ggtcatctcg ttgatgtctc cgatggcttc gagacgctcc atggcctcgt agaagtogac 7860 
ggcgaagttg aaaaattggg agttgcigggo ggccaccgtg agttcttctt gcaggaggcg 7920 
gatgagaccg gcgaccgtgt cgcgcacctc ctgttcgaaa gcgccccgag gcgcctctgc 7980 
ttcttcctcc ggctcctcot cttccagggg ctcgggttcc tccggcagct ct^cgacggg 8040 
gacggggcgg cgacgtcgtc gcctgaccgg caggfcggtcc acgaagcgct cgatcattcc 8100 
gccgcgcogg cgacgcatgg tctcggtgac ggcgcgtccg ttttogcgag gtcgcagttc 8160 
gaagatgccg ccgcgcagag cgcccccgtg cagggagggt aagtggttag ggccgtcggg 8220 
cagggaoacg gcgetgacga tgcattttat caattgctgc gtaggcactc cgtgcaggga 8280 
tctgagaacg tcgaggtcga cgggatcoga gaacttctct aggaaagcgt ctatccaatc 8340 
gcaatcgcaa ggtaagctga gaaoggtggg tcgctggggg gcgttcgcgg gcagttggga 8400 
ggtgatgctg ctgatgaege aattaaagta ggcggtcttc aggcggcgga tggtggcgag 
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gaggaccaeg tctttgggcc cggcctgttg aatgcgcagg cgctcggcca- tgccccaggc 8520 

etcgctctga cagcgacgca ggtctttgta gaagtcttgc atcagtctct ccaccggaae 8580 

ctctgcttct cccctgtctg ccatgcgagt cgagccgaac ccccgcaggg gctgcagcaa 8640 

cgctaggtcg gccacgaccc tttcggccag cacggcctgt tgaatcfgcg tgagggtggc 8700 

ctggaagtcg tccaggtcca cgaigcggtg ataggccccc gtgttgatgg tgta^tgca 8760 

gttggccatg acggaccagt tgacgacttg catgccgggt tgggtgatct c^tgtactt 8820 

gaggcgcgag taggccctgg actcgaacac gtagtcgttg catgtgcgca ecagatactg 8880 

gtagccgacc aggaagtgag gaggcggetc tcggtacagg ggccagccaa cggtggcggg S940 

ggcgccgggg gacaggtcgt ccagcatgag gcggtggtag tggtagatgt a^egggagag 9000 

ccaggtgatg ccggccgagg tggtegcggc cctggtgaat tcgcggacgc ggttccagat 9060 

gttgcgcagg ggaccaaagc getccatggc gggcacgctc tgccccgtga ggcgggcgca 9120 

atcttgtacg ctctagatgg aaaaaagaca gggcggtcat cgactccttt ccgtagcttg 9X80 

gggggtaaag tcgcaagggt gcggfeggcgg ggaaccccgg ttcgagaccg gccggatccg 9240 

cogctcccga tgcgcctggc cccgcatcca cgaegtccgc gccgagaccc agccgcgacg 9300 

ctccgcccca atacggaggg gagtcttttg gtgtetcttc gtagatgcat ccggtgctgc 9360 

ggcatgatgcg aecccagacg cccaetacca ccgccgtggc ggcagtaaac cizgagcggag 9420 

gcggtgacag ggaggaggaa g^gctggctt tagacctgga agagggagag gggctggccc 94 80 

ggctgggagc gccateccca gagagkcacc ctagggttca gctcgtgagg gacgccaggc 9540 

aggcttttgt gccgaagcag aacctgttta gggaccgcag cggtcaggag gcggaggaga 9600 

tgcgcgattg caggtttcgg gcgggcagag agctcagggc gggcttcgaC igggagcggc 9660 

tcctgagggc ggaggatttc gagcccgacg agcgttctgg ggtgagcccg gcccgcgctc 9720 

acgtatcggc ggccaacctg gtgagcgcgt acgagcagac ggtgaacgag gagcgcaact 9780 

tccaaaagag ctttaacaat cacgtgagga ccctgatcgc gagggaggag gtgaccatcg 9840 

ggctgatgca tctgtgggac ttcgtggagg cctacgtgca gaacccggct agcaaacccc 9900 

tgacggccca gctgttcctg atcgtgcagc acagccgcga caacgagacg ttccgcgacg 9960 

ccatgttgaa catcgcggag cecgagggtc gctggctctt ggatetgatt aacatcetgc 10020 

agagcatcgt ggtgcaggag aggggcctga gtttagcgga caaggtggcg gcikttaact 10080 

attcgatgca gagcctgggg aigttctacg ctcgcaagae ctacaagagc ccttacgtgc 10i40 

ccatagacaa ggaggtgaag atagacagct tttacatgcg catggcgctg aaggtgctga 10200 
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egctgagcga cgacctcgge gtgtaccgta acgaoaagat ecacaaggcg gtgagcgeca 10260 
gccgccggcg ggagctgagc g«agggagc tgatgcacag cctgcagagg gcgctggcgg 10320 
gcgccgggga cgaggagcgc gaggcttact tcgacatggg agccgatetg cagtggcgtc 10380 
ccagcgcgcg cgccttggag geggdg&gtt atcccgacga ggaggatcgg gacgatttgg 10440 
aggaggcagg cgagtacgag gacgaagcct gaccgggcag gtgttgtttt agatgcagcg losoo 
gccggcggac gggaccaccg cggktcccgc acttttggca tccatgcaga gtcaaccttc 10560 
9ggcgtgacc gcctccgatg actgggcggc ggccatggac cgcatcatgg cgctgaccac 10620 
ccgcaacccc gaggotttta ggcagcaacc ccaggccaac cgtttttcgg ccatcttgga 10680 
agcggtggtg ccgtcgcgca ccaacccgac gcacgagaaa gtcctgacta tcgtgaacgc 10740 
cctggtagac agcaaggcca tccgccgtga cgaggcgggc ttgatttaca apgctctttt 10800 
ggaacgcgtg gcgcgctaca acagcactaa cgtgcagacc aatotggacc gcctcaccac 10860 
cgacgtgaag gaggcgctgg cgcagaagga gcggtttctg agggacagta atctgggctc 10920 
tctggtggca ctgaacgcct tcctgagctc acagccggcc aacgtgcccc gcgggcagga 10980 
. ggattacgtg agctteatca ge^ctctgag actgctggtg tccgaggtgc ^ccagagcga 11040 
ggtgtaccag tctgggccgg attacttttt coagacgtce cgacagggct tgcaaacggt 11100 
gaacctgact caggccttta aaaacttgca aggcatgtgg ggggtcaagg ccccggtggg liifio 
cgatcgcgcc actatctcca gtctgctgac ceccaacact cgcctgctge tgdtcttgat 11220 
cgcaccgttt aocaacagta gcactatcag ccgtgactog tacctgggtc atctcatcac 11280 
tctgtaccgc gaggccatcg gccaggctca gatcgacgag catacgtatc aggagattac 11340 
taacgtgagc cgtgccctgg gtcaggaaga taccggoagc ctggaagcca egttgaactt 11400 
tttgctaacc aaccggagge aaaaaatacc ctcccagttc acgttaagcg cogaggagga 11460 
gaggattctg cgatacgtgc agcagtccgt gagcctgtac ttgatgogcg agggcgccac 11S20 
.egcttccacg gctttagaca tgacggctcg gaacatggaa ccgtoctttt actccgccca 11580 
ccggccgttc atcaacogte tgatggacta cttcoatcgc gcggccgcca tg^acgggga ii640 
gtaettcace aatgocatcc tgaatccgca ttggatgccc ccgtccggct tctacaccgg 1x700 
ggagtttgac ctgccogaag ccgacgacgg ctttctgtgg gacgacgtgt ccgatagcat 11760 
tttcacgccg gctaatcgcc gattccagaa gaaggagggc ggagacgagc tccccctctc 11820 
cagcgtggaa gcggcctcaa ggggagagag tccctttcca agtctgtctt ccgccagtag 1188O 
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oggtcgggta acgcgtecac ggttgccsgg ggagagogac tacctgaacg accccttgct 11940 

gegaccggec agaaagaaaa atttecccaa taacggggtg gaaagcttgg tggataaaat 12000 

gaatcgttgg aagacgtacg cccaggagca gcgggagtgg gaggacagtc agccgcggoc 12O6O 

gctggtaccg ccgcattggc gtcgccagag agaagacccg gacgactccg cagacgatag 12120 

tagcgtgttg gacctgggag ggagcggagc caaccccttt gctcacttgc aacccaaggg 12180 

gcgctcgage cgcctgtatt aataaaaaag acgcggaaac ttaccagagc caitggccaca 12240 

gcgtgtgtgc tttcttcctc tctttcttcc tcggcgcggc agaatgagaa gagcggtgag 12300 

agtcacgccg gcggcgtatg agggcccgcc cccttcttac gaaagcgtga tgggatcagc 12360 

gaacgtgccg gccacgctgg aggcgoctta cgttcctccc agatacctgg gacctacgga 12420 

gggcagaaaci agcatccgtt actccgagct ggcgcccctg tacgatacca ccaaggtgta 12480 

cctggtggac aacaagtcgg cggacatcgc ctccctgaat taccaaaacg atcacagtaa 12S40 

ctttctgact accgtggtgc agaacaatga cttcaccccg acggaggcgg gcacgcagac 12600 

cattaacttt gacgagcgtt ccdgctgggg cggtcagotg aaaacoatcc tgdacaccaa 12660 

catgcccaac atcaacgagt tcatgtccac caacaagttc agggctaagc tgatggtaga 12720 

aaaaagtaat gcggaaactc ggcagccccg atacgagtgg ttcgagttta ccattccaga 12780 

gggcaactat tccgaaaeta tgactatcga tctcatgaat aacgcgatcg tggacaatta 12840 

cctgcaagtg gggagacaga acggggtgct ggaaagcgac accggcgtga aa^cgatac 12900 

cagaaacttc cgactggggt gggatcccgt gaccaagctg gtgatgecag gcgtgtacac 12960 

eaacgaggct tttcacccgg acatc&tgct gctgccgggg tgcggtgtgg acttcactca 13020 

gagccgtttg agtaacctgt taggaattag aaagcgccgc cccttccaag agggctttca 13 080 

aatcatgtat gaggacctgg agggaggtaa tatacccgcc ttactggacg tgttfgaagta 13140 

egaagctagc ataeaaegcg ccaaagcgga gggtagagag attcggggag acacctttgc 13200 

ggtagctccc caggacctgg aaatagtgcc tttaactaaa gacagcaaag icagaagcta 13260 

caatattaea.aacaacacga cg$acaccct gtatcggagc tggtttctgg cttacaacta 13320 

cggagacccc gagaaaggag tgagatcatg gaccatactc accaccacgg acgtgacctg 13380 

tggctcgcag caagtgtact ggfeccctgcc ggatatgatg caagacccgg tcaccttccg 13440 

ccoctccacc caagtcagca acttcccggt ggtgggcacc gagctgotgc ccgtccatgc 13500 

caagagottc tacaacgagc aggecgtcta ctcgcaactt attcgccagt deaccgcgct 13S60 

tacccacgtg ttcaatcgct ttcccgagaa ccagatcctg gtgcgccctc ccgctcctac 13620 
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cattaccacc gtcagtgaaa acgttcccgc cctcacagat cacggaacce tgccgctgcg 13680 
cagcagtatc agtggagttc agcgcgtgac catcaccgac gccagacgtc gaacctgccc 13740 
ctacgtttac aaagcgcttg gcgtggtggc tcctaaagtt ctttctagtc gcaccttcta 13800 
aaaacatgtc oatcctcate tctcccgata aeaacaccgg ctggggactg ggctccggca 1386O 
agatgtacgg cggagccaaa aggcsitcoa gtcagcaccc agttcgagct cggggccact 13920 
tccgcgctcc ttggggagct ticaagcgag gactctcggg tcgaacggct gtagacgata 13980 
ccatagatge cgtgactgcc gacigcecgcc ggtacaaccc cggacoggtc gctagcgccg i4040 
cctccaccgt ggattccgtg atcgacagcg tggtagccgg cgctcgggcc tatgctcgcc 14100 
gcaagaggcg gctgcatcgg agacgtcgcc ecaccgccgc catgctggca gccagggccg 1416O 
tgctgaggcg ggcccggagg gcafecagaa gggctatgcg ccgcgccgcc gccaacgccg 14220 
ccgccgggag ggcccgccga caggctgccc gccaggetgc cgctgccacc gctagcatgg 14280 
ccagacccag gagagggaac gtgtactggg tgcgtgattc tgtgacggga gtccgagtgc 14340 
cggtgcgcag ocgacctccc cgaagttaga agatccaagc tgcgaagacg gcggtactga 14400 
gtctccctgt tgttatcagc c6aacatgag caagcgcaag tttaaagaag aactgctgca 14460 
gacgctggtg cctgagatct atggccctcc ggacgtgaag ccagacatta agccccgcga 14520 
tatcaigcgt gttaaaaagc gggaaaagaa agaggaactc gcggtggtag aWatggcgg 14580 
agtggaattt attaggagtc tcgccccgcg acgcagggtt caatggaaag ^cggcgggt 14640 
aeaacgcgtt ttgaggccgg gcaccgcggt agtttttacc ccgggagagc .ggtcggccgt 14700 
taggggtttc aaaaggcagt acgacgaggt gtacggcgac gaggacatat tggaacagge 14760 
ggctcaacag atcggagaat ttgcctacgg aaagcgtteg cgtcgcgaag acctggccat 14820 
cgccttagac agcggcaacc ccacgcccag cctcaaaccc gtgacgctgc ^gcaggtgcc 14880 
tcccgtgagc gccagcacgg acagcaagag ggggattaag agagaaatgg aaga^tctrgea 14940 
tcccaccatc ciactoatgg tccctaaacg gcagaggctg gaagaggtcc tSgagaagat 15000 
gaaagtggac cccagcatag agccggatgt aaaagtcaga cctattaagg aa^tggcccc 15060 
cggtcttggg gtgcaaacgg tggackttca aatdcccgtc accaccgctt caaccgccgt 15120 
Sgaagctatg gaaacgcaaa cg;3^gacccc tgccgcgatc ggtaccaggg aagtggcgtt 15180 
gcaaacggag ccttggtacg aatacgcagc ccctcggcgt cagaggcgtt ccgctcgtta 1S240 
cggccccgec aacgccatca tgccagaata tgcgctgcat ccgtctattc tgiccactcc 15300 
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eggataccgg ggtgtgacgt atcgdccgte tggaacccgc cgccgaaccc gtcgccgccg 15360 
cegctcccgt cgcgctctgg cccccgtgtc ggtgcggcgt gtgacccgcc ggggaaagac 15420 
agtcgtcatt cccaacccgc gttaccaccc tagcatcctt taataactct gccgttttgc 15480 
agatggctct gacttgccgc gtgcgccttc ccgttccgca ctatcgagga agatctcgtc 15540 
gtaggagagg catgacgggc agtggtcgcc ggcgggcttt gcgcaggcgc atgaaiggcg " 15600 
gaaetttacc cgccctgata cccataattg ccgccgccat cggtgccata cccggcgttg 15660 
cttcagtggc gttgcaagca getcgtaata aataaacaaa ggcttttgoa cttatgacct 15720 
ggtoctgact attttatgca gaaagagcat ggaagacatc aattttacgt cgct^ctcc 15780 
gcggcacggc tcgcggccgc toatgggcae ctggaacgac atcggcacca gtcagctcaa 15840 
cgggggcgct ttcaattggg ggagcctttg gagcggcatt aaaaactttg gctccacgat 15900 
taaatcctac ggcagcaaag cctggaacag tagtgctggt cagatgctcc gagataaact 15960 
gaaggacacc aacttccaag aaaaagtggt caatggggtg gtgaccggca cccacggcgc 16020 
ggtagatctc gccaaccaag. cggtgcagaa agagattgac aggcgtttgg aaagctcgcg 16080 
ggtgccgccg cagagagggg atgaggtgga ggtcgaggaa gtagaagtag aggaaaagct 16140 
gcccccgctg gagaaagttc ccggtgcgcc tccgagaccg cagaagcggc ccaggccaga 16200 
actagaagag actctggtga cggagagcaa ggagcctccc tcgtacgag,: aagccttgaa 16260 
agagggcgcc tctccaccct cctacccgat gactaagccg atcgcaccca tggbtcgacc 16320 
ggtgtacggc aaggattaca agcccgtcac gctagagctg cccccaccgc cccccacgcg 16380 
cccgaccgtc ccccccctgc cgactccgtc ggcggccgcg gcgggacccg tgtccgcacc 16440 
atccgctg.:g cctctgccag ccgpccgtcc agtggccgtg gccactgcca gaa^ccocag 16500 
aggccagaga ggagccaact ggcaaagcac gctgaacagc a.cgtgggce tgggagtgaa 16560 
aagcctgaaa cgccgccgtt g.^tattatta aaaaagtgta gctaaaaagt ctcccgttgt 16620 
atacgcctcc tatgttaccg ccagagacga gtgactgtcg ccgcgagcgc cgctttcaag 16680 
atggccaccc catcgatgat gpcgcagtgg tcttacatgc acatcgccgg ccaggacgcc '16740 
tcggagtacc tgagtcccgg cctcgtgcag tttgcccgcg ccaccgacac ctacttcagc 16800 
ttgggaaaca agtttagaaa ccccaccgtg gcccccaccc acgatgtgac cacggaccgc 16860 
tcgcagaggc tgaccctgcg ctttgtgccc gtagaccggg aggacaccge gtactcttac 16920 
aaagtgcgct acacgttggc cgttfggggac aaccgagtgc tggacatggc cagcacctac 16980 
tttgacatcc ggggggtgct gghtcggggt cccagcttca agccctattc cggcaccgct 17040 
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tacaactccc tggcccccaa gggagctccc aacccctcgg aatggacgga cacttccgac X7100 

aacaaacbta aagcatatgc tcaggctccc taccagagtc aaggacttac aaaggatggt 17160 

attcaggttg ggctagttgt gacagagtca ggacaaacac cccaatatgc aaacaaagtg 17220 

taccaacccg agccacaaad tggggaaaac caatggaatt tagaacaaga .agataaagcg 17280 

gcgggaagag tcctaaagaa agatacccct atgtttccct gctatgggtc atatgccagg 17340 

cccacaaacg aacaaggagg gcaggcaaaa aaccaagaag tagatttaca gttttttgcc 17400 

actccgggcg acacccagaa cacggctaaa gtggtacttt atgctgaaaa tgtcaacctg 17460 

gaaactccag atactcactt agtgtttaaa cccgatgacg acagcaccag ttcaaaactt 17S20 

cttcttgggc agcaggctgc acctaacaga cccaactaca taggttttag agataatttt 17580 

attggtttaa tgtactacaa tagcactgga aacatgggcg tgctggccgg acaggcctct 17640 

caattgaatg ccgtagtcga cttgcaggac agaaacaccg agttgtccta cca'gctgatg 17700 

ctggacgcac tgggggatcg cagccaatat: ttttcaatgt ggaatcaggc agtagacagc 17760 

tatgacccag acgttagaat tatagaaaac cacggagtgg aagacgaact gccaaactat 17820 

tgttttcctc tgggaggaat: ggtggtgact gacaattaca actctgtgac gcctcaaaat 17880 

ggaggcagtg gaaatacatg gcaggcagac aatactacat: ttagtcaaag aggagcgcag 17940 

attggctccg gaaacatgtt tgccctggaa attaacctac aggccaacct ctggcgcggc 18000 

ttcttgtatt ccaatattgg gttgtatctt ccagactctc tgaaaatcac ccccgacaac 18060 

atcacgctgc cagaaaacaa aaeLcacttat cagtacatga acggtcgcgt aacgccaccc 18120 

gggctcatag acacctatgt aaacgtgggc gcgcgctggt cccccgatgt catggacagc 18180 

attaacccct tcaaccacca ccgtaacgcg ggcttgcgct accgctccat ^ctcttgggc 18240 

aacggccgtt atgtgccttt tccicattcag gtgccccaaa aattctttgc catcaaaaac '18300 

ctgctgcttc tccccggttc ctatacctat gagcggaacn tccgcaagga tgtcaacatg 18360 

atcctgcaga gctcgctggg taatfgacctg cgagtggacg gggccagcat acgctttgac 18420 

agcattaacc tgtatgccaa cttttctccc atggcccaca acacggcctc taccctggaa 18480 

gccatgctgc gcaacgacac caatgaccag tccttcaacg actacctgtg cgcggctaac 18540 

atgctgtacc ccatccccgc caacgccacc agcgtgccca tttctattcc ttctcggaac 18600 

tgggctgcct tcaggggctg gagttttact cgcctcaaaa ccaaggagac tccctcgctg 18660 

ggctccggtt ttgaccccta ctttgtttac tccggctcca ttccctacct agfttggcacc 18720 
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ttttacctca aceacacttt caapaaggtg tctattatgt ttgactcctc ggttagctgg 18780 

cccggcaacg accgcctgct aacgcccaac gagctcgaaa ttaagcgttc cgtggacggt 18840 

gaagggeaca acgtggccca gagcaacatg aceaaggact ggtttctaat toaaatgctc 18900 

agtcactata atataggtta ccagggcttc tatgtgcccg agaactacaa ggaccgcatg 18960 

tactccttct tccgcaactt ceaaccaatg agccggcagg tggtagatac cgtgacttat 19020 

acagactaca aagatgtcaa gctcccdtac caacacaaca actcagggtt cgtgggctac 19080 

atgggaccoa ccatgcgaga gggacaggco tacccggcca actatcccta cccictgatc 19140 

ggagagactg ccgtaoccag cctcacgcag aaaaagttcc tctgcgaccg ggtgatgtgg 19200 

aggataccct tctctagcaa ctktatgtcg atgggctccc tcaccgaect ggggcagaac 19260 

atgctgtacg ocaaotocgc tcapgccttg gacatgactt ttgaggtgga tcccatggat 19320 

gagcccacgc ttctctatgt t<;tgtttgaa gtcttcgacg tggtgcgcat ccabcagccg 19380 

caccgcggcg tcatcgaggc c^tctacctg cgcacacctt tctctgccgg taacgceacc 19440 

acctaaagaa gctgatgggt tccagcgaac aggagttgca ggccattgtt cgcgacctgg 19500 

gctgcgggcc ctgctttttg ggcaccttcg acaagcgttt tcccggatte atgtcccccc 19560 

acaagccggc ctgcgccatc gttaacaegg ccggacggga gacagggggg gtgcactggc 19620 

tcgccttcgc ctggaacccg cgcaaccgca cctgctacct. gttcgaccct tttggtttct 19680 

ccgacgaaag gctgaagcag atctaccaat tcgagtacga ggggctcctc aagcgcagcg 19740 

ctctggcctc cacgcccgac caot^cgtca ccctggaaaa gtccacccag acggtccagg 19800 

ggcccctctc ggccgcctgc gggctettct gttgcatgtt tttgcacgcc ttcgtgoact 19860 

ggcctcacac ecccatggag cgcaacccca ccatggatct gctcaccgga gtgcccaaca 19920 

gcatgcttca cagtccccag gtcgccocca ccctgcgtcg caatcaggac cacctgtatc 19980 

gctttctggg gaaacactct gcctatttcc gccgccaccg gcagcgcatc gaacaggcca 20040 

cggccttcga aagcatgagc caaagagtgt aatcaataaa aaccgttttt acttbacatg 20100 

atacgcgctt ctggcgtttt tattaaaaat cgaagggttc gagggagggg tcdtcgtgcc 20160 

•^ctggggag ggacacgttg cggtactgga atcgggcgct' ccaacgaaac tiggggatca' 20220 

ccagccgcgg cagggccacg tbttccatgt tctgcttcca aaactgtcgc accagctgca 20280 

gggctcccat cacgtcgggo gctgagatct tgaagtcgca gttagggccg gagcccccgc 20340 

ggctgttgcg gaacacgggg ttggcacact ggaacaccaa caegctgggg ttgtggatac 20400 

tagccagggc cgtcgggtcg gtcacctccg atgcatccag atcctcggca ttgctcaggg 20460 



97 



cgaacggggt cagcttgcac atctgccgcc cgatctgggg taccaggtcg cgct;tgttga 20520 
ggcagtcgca gcgcagaggg atgaggatgc gacgctgccc gcgttgcatg atggggtaac 20580 
tcgccgccag gaactcctct atctg'acgga aggccatctg ggccttgacg ccctcggtga 20640 
aaaatagccc acaggacttg ctggaaaaca cgttattgcc acagttgatg tcttqcgcgc 20700 
agcagcgcgc atcttcgttc ttcagctgaa ccacgttgcg accccagcgg ttctgaacca 20760 
ccttggcttt cgtgggatgc tccttcagcg cccgctgtcc gttctcgctg gtcacatcca 20820 
tttccaccac gtgctccttg cagaccatct ccactccgtg gaaacagaac agaatgccct 20880 
cctgttgggt attgcgatgc tcccacacgg cgcacccggt ggactcccag ctcttgtgtt 20940 
tcacccccgc gtaggcttcc atgtaagcca ttagaaatct gcccatcagc tcagtgaagg 21000 
tcttctggtt ggtgaaggtt agcggcaggc cgcggtgttc ctcgttcaac caagtttgac 21060 
agatcttgcg gtacacggct ccctggtcgg gcagaaactt aaaagtcgtt ctgctctcgt 21120 
tgtccacgtg gaacttctcc accaacatcg tcatgacttc caCgcccttc tcccaggcag 21180 
tcaccagcgg cgcgctctcg gggttcttca ccaacacggc ggtggagggg ccctcgccgg 21240 
ccccgacgtc cttcatggac attttttgaa actccacggt gccgtccgcg cggcgtactc 21300 
tgcgcatcgg agggtagctg aagcccacct ccatgacggt gctttcgccc tcgetgtcgg 21360 
agacgatctc cggggagggc ggcggaacgg gggcagactt gcgagccttc ttcttgggag 21420 
ggagcggagg cacctcctgc tcgcgctcgg gactcatctc ccgcaagtag ggggtgatgg 21480 
agcttcctgg ttggttctga cggttggcca ttgtatccta ggcagaaaga catggagctt 21540 
atgcgcgagg aaactttaac cgccccgtcc cccgtcagcg acgaagaggt catcgtcgaa 21600 
caggacccgg gctacgttac gccgcccgag gatctggagg ggcccttaga cgaccggcgc 21660 
gacgctagtg agcggcagga aaatgagaaa gaggaggagg agggctgcta cctcctggaa 21720 
ggcgacgttt tgctaaagca tttcgccagg cagagcacca tactcaagga ggccttgcaa 21780 
gaccgctccg aggtgccctt ggacgtcgcc gcgctctccc aggcctacga ggcgaacctt 21840 
ttctcgcccc gagtgcctcc gaagagacag cccaacggca cctgcgagcc caacccgcga 21900 
ctcaacttct accccgtgtt cgccgtgccc gaggcgctgg ccacctacca catctttttc 21960 
aaaaaccagc gcattcccct ttcctgccgg gccaaccgca ccgcggccga taggaagcta 22020 
acactcagaa acggagtcag catacctgat atcacgtcac tggaggaagt gcct:aagatc 22080 
ttcgagggtc tgggtcgaga tgagaagcgg gcggcgaacg ctctgcagaa agaacagaaa 22140 
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gagagtcaga acgtgctggt ggagctggag ggggacaacg cgcgtctgac cgtcctcaaa 22200 
cgttgcatag aagtttccca cttcgpctac ccggccctca acctgccgcc caaagttatg 22260 
aaatcggtca tggaccagct actcatcaag agagctgagc ccctgaatcc cgaccaccct 22320 
gaggcggaaa actcagagga cggaaagccc gtcgtcagcg acgaggagct cgagcggtgg' 22380 
ctggaaacca gggaccccca gcagttgcaa gagaggcgca ' agatgatgat ggcggccgtg 22440 
ctggtcacgg tggagctaga atgcctgcaa cggtttttca gcgacgcgga gkcgctacgc 22500 
aaaatcgggg agtccctgca ctacaccttc cgccagggct aegttcgcca ggcctgcaaa 22560 
atctccaacg tagagctcag caacctggtt tcctacatgg gcatcctcca cgagaaccgg 22620 
ctggggcaga gcgtgctgca ctgcaccttg caaggcgagg cgcgaaggga ctacgtccga 22680 
gactgcgtct acctcttccc caccctcacc tggcagaccg ceatgggcgt gtggcagcag 22740 
tgcttggaag agagaaacct caaagagctg gacaaactcc tctgccgcca gcggcgggcc 22800 
ctctggaccg gctccagcga gcgcacggtc gcctgcgccc tggcagacat cattttccca 22860 
gaacgcctga tgaaaacctt gcagaacggc ctgccggatt tcatcagtca gagcatcttg 22920 
caaaacttcc gctccttcgt cctggagcgc tccgggatct tgcccgccat gagctgcgcg 22980 
ctgccctctg actttgtccc cctttcctac cgcgagtgcc ctcccccact gtggagccac 23040 
tgctacctct tccaactggc caactttctg gcctaccact ccgacctcat ggkgacgtg 23100 
agcggagagg ggctgctcga gtgccactgc cgctgcaacc tctgcacccc ccaiagatcg 23160 
ctggcctgca acaccgagct gctcagcgaa acccaggtca taggtaccet cgagatccag 23220 
gggccccagc agcaagaggg tgcttccggc ttgaagctca ctccggcgct gtggacctcg 23280 
gcttacttac gcaaatttgt agccgaggac taccacgccc acaaaattca gttttacgaa 23340 
gaccaatctc gaccaccgaa agctcccctc acggcccgcg tcatcaccca gagcaaaatc 23400 
ccggcccaat tgcaatccat caaccaagcg cgccgagatt tccttttgaa aaagggtcgg 23460 
ggggtgtacc tggaccccca- gaccggcgag gaactcaacc cgtccacact ttccgtcgaa 23S20 
gcagcccccc cgagacatgc cacccaaggg aaccgccaag cagctgatcg" etcggcagag 23580 
agcgaagaag caagagctgc tccagcagca ggtggaggac gaggaagage tgtg^gacag 23640 
ccaggoagag gaggtgtcag aggacgagga ggagatggaa agctgggaca gcctagacga 23700 
ggaggacgag ctttcagagg aaga^gcgac cgaagaaaaa ccacctgcat ccagcgcgcc 23760 
ttctctgagc cgacagccga agccccggcc cccgacgccc ccggccggct caccaaagc 23820 ' 
cagccgtagg tgggacgcca ccggatctcc agcggcagcg gcaacggcag cgggt^aaggc 23880 
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caaacgcgag cggcgggggt attgctcctg gcggacccac aaaagcagta tcgtgaactg 23940 
cttgcaacac tgcgggggaa acatctcctt tgcccgacgc tacctcctct tccatcacgg 24000 
tgtggccttc cctcgcaacg ttctctatta ttaccgtcat ctctacagcc cctacgaaac 24060 
gctcggagaa aaaagctaag goctcctotg cogcgaggaa aaactccgcc gccgctgccg 24120 
ccaaggatcc gccggccace gagWtga gaaagcgcat ctttcccact ctgtatgcta 24180 
tctttcagca aagccgoggg cagcaccetc agcgogaact gaaaataaaa aaccgctcct 24240 
tccgctcact cacccgcagc tgtctgtacc acaagagaga agaccagctg cagcgcaccc 24300 
tggacgacgc cgaagcactg ttcagcaaat actgctcagc gtctcttaaa gactaaaaga 24360 
cccgcgctct ttccccctcg ggcgccaaaa cccacgtcat egccagcatg agcaaggaga 24420 
ttcccacccc ttacacgtgg agctatcagc cccagatggg cctggccgcg ggggccgccc 24480 
aggactactc cagcaaaatg aactggctca gcgccggccc ccacatgatc tcacgagtta 24540 
acggcatccg agcccaccga aiccagatcc' tcttagaaca ggcggcaatc accgccacac 24600 
cccggcgcca actcaaeccg cccagttggc ccgccgccca 99tgtatcag gaaactcccc 24660 
gcccgaccae agtcctcctg ccacgcgacg cggaggccga agtcctcatg actaactctg 24720 
gggtacaatt agcgggcggg tccaggtacg ccaggtacag aggtcgggcc gctcottact 247B0 
ctcccgggag tataaagagg gtgatcattc gaggccgagg tatccagctc aabgairgagg 24840 
cggtgagctc ctcaaccggt ctcagacctg acggagtctt ccagctcgga ggagcgggcc 24900 
gctcttcctt caccactcgc caggccCacc tgaccctgca gagctcttcc tcgcagccgc 24960 
gctccggggg aatcggcact ccccagttcg tggaagagtt cgccccctcc gtctacttca 25020 
acccgttttc cggctcacct ggacgctacc cggacgcctt cattcccaac ttigacgcag 2S080 
tgagtgaatc cgtggacggc tacigactgat gacagatggt gcggccgtga g^gctcggct 25140 
gcgacatctg catcactgcc gccagcctcg ctgetacgct cgggaggcga tcgtgttcag 25200 
ctactttgag ctgccggacg agcaccctea gggaccggct cacgggttga aacccgagat 25260 
tgagaacgcg ctcgagtctc acctcaccga cgccttcacc gcccggcctc tcctggtaga 2S320 
aaccgaacgc gggatcacta ccatcaccct gttctgeatc tgccccacgc ccggattaea 25380 
tgaagatctg tgttgtcatc tttgcgctca gtttaataaa aactgaactt tttgcogtac 25440 
cttcaacgcc acgcgttgtt tctccttgtg aaaaaaeccc aggagtcctt aacttacaca 25500 
tagcaaaacc cttgtatttt accatagaaa aacaactagc cctttcaatt ggaaaagggt 25560 
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taacaatttc tgctacagga cagttggaaa gcacagcaag cgtacaggac agcgctacac 25620 
cacccctacg tggtatttcc cctttaaagc tgacs^gacaa cggtttaaca ttaagctatt 25680 
cagatcccct gcgtgtggta ggtga^caac ttacgtttaa ttttacttct ccactacgtt 25740 
acgaaaatgg cagtcttaca ttcaactaca cttctcccat gacactaata aacaacagtc 25800 
ttgctattaa cgtcaatacc tcca^aggcc tcagtagtga caacggcaca ct.cg'ctgtaa 25860 
atgttactcc agattttaga tttaacagct ctggtgcctt aacttttggc atacaaagtc 25920 
tatggacttt tccaaccaaa actectaact gtaccgtgtt taccgaaagt gactccctgc 25980 
tgagtctttg cttgactaaa tgqggagctc acgtacttgg aagcgtgagt ttaagcggag 26040 
tggcaggaac catgctaaaa atgacccaca cttctgttac cgttcagttt tcgtttgatg 26100 
acagtggtaa actaatattc tctccacttg cgaacaacac ttggggtgtt cgacaaagcg 26160 
agagtccgtt gcccaaccca tccttcaacg ctctcacgtt tatgccaaac agtac.cattt 26220 
attctagagg agcaagtaac gaacctcaaa acaattatta tgtccagacg tatcttagag 26280 
gcaacgtgcg aaagccaatt ctactaactg ttacctacaa ctcagttaat tcaggatatt 26340. 
ccttaacttt taaatgggat gctgtcgcca atgaaaaatt tgccactcct acatcttcgt 26400 
tttgctatgt tgcagagcaa taaaaccctg ttaccccacc gtctcgtttt tttcagatga 26460 
aacgagcgag agttgatgaa gacttcaacc cagtgtaccc ttatgacccc ccatacgctc 26520 
ccgtcatgcc cttcantact ccgcct'ttta cctcctcgga tgggttgcag gaaaaaccac 26580 
ttggagtgtt aagtttaaac tacagggatc ccattactac acaaaatggg tctctcacgt 26640 
taaaactagg aaacggcctc actctaaaca accagggaca gctaacatca actgctggcg 26700 
aagtggagcc tccgctcact aatgctaaca acaaacttgc actagcctat agcgaaccat 26760 
tagcagtaaa aagcaaccgc ctaactctat cacacaccgc tccccttgtc atcgctaata 26820 
attctttagc gctgcaagtt tcagagccta tctttgtaaa tgacgatgac aagctagccc 26880 
tgcagacagc cgcccccctt gtaaccaacg ctggcaccct tcgcttacag agcgctgccc 26940 
ctttaggatt ggttgaaaat actcttaaac tgctgttttc taaacccttg tatttgcaaa 27000 
atgattttct tgcattagcc attgaacgcc ccctggctgt agcagccgca g^tactctga 27060 
ccctacaact tactcctcca ttaaagacta acgatgacgg gctaacacta tccaqagtcg 27120 
agccattaac tgtaaaaaac ggaaacctag gcttgcaaat atcgcgccct ttagttgttc 27180 
aaaacaacgg cctttcgctt gctattaccc ccccgctgcg tttgtttaac agcgaccccg 27240 
ttcttggttt gggcttcact tttcccctag ctgtcacaaa caacctcctc tccttaaaca 27300 
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tS99^9^C99 agttaaactt acctataata aactaacagc caatttgggt agggatttac 27360 

aatttgaaaa cggtgcgatt gccgtaacgc ttactgccga attacctttg caatacacta 27420 

acaaacttca actgaatatt ggagctggcc ttcgttacaa tggagccagc agaaaactag 27480 

atgtaaacat taaccaaaat aaaggcttaa cttgggacaa cgatgcagtt ^ttcccaaac 27S40 

taggatcggg cttacaattt gaccctaatg gcaacatcgc tgttatccct gaaaccgtga 27600 

agccgcaaac gttatggacg actgcagatc cctcgcctaa ctgctcagtg taccaggact 27660 

tggatgccag gctgtggctc gctcttgtta aaagtggcga catggtgcat ggaagcattg 27720 

ccctaaaagc cctaaaaggg acgttgctaa atcctacagc cagctacatt tccattgtga 27780 

tatat:tttta cagcaacgga gtcaggcgta ccaactatcc aacgtttgac aacgaaggca 27840 

ccttiagctaa cagcgccact tggggatacc gacaggggca atctgctaac actaatgtga 27900 

ccaatgccac tgaatttatg cccagctcaa gcaggtaccc cgtgaataaa ggaga'caaca 27960 

ttcaaaatca atctttttca tacacctgta ttaaaggaga ttttgctatg cctgtcccgt 28020 

tccgtgtaac atataatcac gccctggaag ggcattccct taagttcacc tggcgcgttg 28080 

tagccaatca ggcctttgat attccttgct gttcattttc atacatcaca gaataaaaaa 28140 

ccactttttc at'tttaattt ctttttattt tacacgaaca gtgagacttc ctccaccctt 28200 

ccatttgaca gcatacacca gcctctcccc cttcatagca gtaaactgtt gtgaatcagt 28260 

ccggtatttg ggagttaaaa tccaaacagt ctctttggtg atgaaacgtc gatcagtaat 28320 

ggacacaaat ccctgggaca ggttttccaa cgtttcggtg aaaaactgca caccgcccta 28380 

caaaacaaac aggttcaggc tctccacggg ttatctcccc gatcaaactc agacagggta 28440 

aaggtgcggt ggtgttccac taaaccacgc aggtggcgct gtctgaacct ctcggtgcga 28SO0 

ctcctgtgag gctggtaaga agttagattg tcdagtagcc tcacagcatg tatc^tcagt 28560 

ctacgagtgc gtictggcgca gcagcgcatc tgaatctcac tgagattccg gcaagaatcg 28620 

cacaccatca caatcaggtt gttcatgatc ccatagctga acacgctcca gccaaagctc 28680 

attcgctcca acagcgccac cgcgtgtccg tccaacctta ctttaacata aatcaggtgt 28740 

ctgccgcgta caaacatgct acccacatac agaacttccc ggggcaggcc cctgttcacc 28800 
acctgtctgt accagggaaa cctcacattt atcagggagc catagatggc cattttaaac 28860 
caattagcta ataccgcccc accagctcta cactgaagag aaccgggaga gttacaatga 28920 
cagtgaataa tccatctctc ataacccctg atggtctgat gaaaatctag atctaacgtg 28980 
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gcacaacaaa tacacacttt catatapatt ttcataacat gtttttccca ggccgttaaa 29040 
atacaatcec aatacacggg ccactcctgc agtacaataa agctaataca agatggtata 29100 
ctcctcacct cactgacact gtgcatgttc atattttcac attccaagta ccgagagttc 29160 
tcct'ctacag cagcactgct gc^gtcctca caaggtggta gctggtgatg attgtagggg 29220 
gccagtctgc agcgataccg tctgtcgcgt tgcatcgtag accaggaacc gacgcacctc 29280 
ctcgtacttg tggtagcaga accacgtccg ctgccagcac gtctccacgt aacgccggtc 29340 
cctgcgtcgc tcacgctccc tcctcciatgc aaagtgcaiac cactcttgta atccacacag 29400 
atccctctcg gcctccgggg tgatgcacac ctcaaaccta cagatgtctc ggtacagttc 29460 
caaacacgta gtgagggcga gttccaacca agacagacag cctgatctac cccgacacac 29520 
tggaggtgga ggaagacacg gaagaggcat gttattccaa gcgattcacc aacgggtcga 29580 
aatgaagatc ccgaagatga caacggtcgc ctccggagcc ctgatggaat ttaa'cagcca 29640 
gatcaaacgt tatgcgattc tccaagctat cgaccgccgc ttccaaaaga gcctggaccc 29700 
gcacttccac aaacaccagc aaagcaaaag cactattatc aaactcttca atcaccaagc 29760 
tgcaggactg tacaatgcct aagtaatttt cgtttctcca ctcgcgaatg atgtcgcggc 29820 
agatagtctg aaggttcatc ccgtgcaggg taaaaagctc cgaaagggcg ccctctacag 29880 
ccatgcgtag acacaccatc atgactgcaa gatatcgggc tcctgagaca cctgcagcag 29940 
atttaacaga tcaaggtcag gttgctctcc gcgatcacga atctccatcc gcaaggtcat. 30000 
ttgcaaaaaa ttaaataaat ctatgccgac tagatctgtc aactccgcat taggaaccaa 30060 
atcaggtgtg gctacgcagc acaaaagttc cagggatggt gccaaactca ctagaaccgc 30120 
tcccgagtaa caaaactgat gaatgggagt aacacagtgt aaaatgtgca accaaaaatc 30180 
actaaggtgc tcctttaaaa agtccagtac ttctatattc agtccgtgca agtactgaag 30240 
caactgtgcg ggaatatgca caacaaaaaa aatagggcgg ctcagataca tgttgaccta 30300 
aaataaaaag aatcattaaa ctaaagaagc ttggcgaacg gtgggataaa tgacacgctc 30360 
* cagcagcaga' caggcaaccg. gctgtccccg ggaaccgcgg * taaaat^tcat ccgaatgatt 30420 
aaaaagaaca acagaaactt cccaccatgt actcggttgg atctcctgag cacacagcaa 30480 
tacccccctc acattcatgt ccgccacaga aaaaa^acgt cccagatacc cagcggggat: 30540 
atccaacgac agctgcaaag acagcaaaac aatccctctg ggagcgatca caaaatcctc 30600 
cggtgaaaaa agcacataca tattagaata accctgttgc tggggcaaaa aggcccggcg 30660 
tcccagcaaa tgcacataaa tatgttcatc agccattgcc ccgtcttacc gcgta^tcag 30720 
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ccacgaaaaa atcgagctaa aattcaccca* acagcctata gctatatata cactccgccc 30780 

aatgacgcta ataccgcacc acccacgacc aaagttcacc cacacccaca aaacccgcga 30840 

aaatccagcg ccgtcagcac ttccgcaatt tcagtctcac aacgtcactt ccgcgcgcct 30900 

tttcacattc ccacacacac ccgcgccctt cgccccgccc tcg'cgccacc ccgcgtcacc 30960 

gcacgtcacc ccggccccgc ctcgctcctc cccgctcatt atcatattgg cacgtttcca 31020 

gaataaggta tattattgat gatg 31044 

<210> 6 
<2li> 34115 
<212> DHA 

<213> simian adenovirus SV-39 
<400> 6 

catcatcaat ataacaccgc aagatggcga ccgagttaac atgcaaatga ggtgggcgga 60 

gttacgcgac ctttgtcttg ggaacgcgga agtgggcgcg gcgggtttcg gggaggagcg 120 

cggggcgggg cgggcgtgtc gcgcggcggt gacgcgccgg ggacccggaa attgagtagt 180 

ttttattcat tttgcaagtt tttctgtaca ttttggcgcg aaaactgaaa cgaggaagtg 240 

.aaaagtgaaa aatgccgagg tagtcaccgg gtggagatct gacctttgcc gtgtggagtt 300 

tacccgctga cgtgtgggtt tcggtctcta ttttttc^ct gtggttttcc gggtacggtc 360. 

aaaggtcccc attttatgac tccacgtcag ctgatcgcta gggtatttaa tgcgcctcag 420 

accgtcaaga ggccactcCt gagtgccggc gagaagagtt ttctcctccg cgttccgcca 4B0 

actgtgaaaa aatgaggaac ttcttgctat ctccggggct gccagcgacc gtagccgccg 540 

agctgttgga ggacattgtt accggagctc tgggagacga tcctcaggtg atttctcact 600 

tttgt'gaaga ttttagtctt catgatbtct atgatattga tccgggtgtt gaggggcaag 660 

aggatgaatg gctggagtct gt'ggatgggt tttttccgga cgctatgctg ctagaggctg 720 

atttgccacc acctcacaac tctcacactg agcccgagtc agctgctatt cctgaattgt 780 

catcaggtga acttgacttg gcttgttacg agactatgcc tccggagtcg gatgaggagg * 840 

acagcgggat cagcgatccc acggctttta tggtctctaa ggcgattgct ataccaaaag 900 

aagatgatga tggcgatgat ggatttcgac tggacgctcc ggcggtgccg gggagagact 960 

gtaagtcctg tgaataccac cgggatcgta ccggagaccc gtctatgttg tgttctctgt 1020 

gttatctccg tcttaacgct gcttttgtct acagtaagtg ttttgtgctt ttttaccctg . 1080 

tggctttgtt gagtttattt ttttctgtgt ctcatagggt gttgtttatt ataggtcctg 1140 
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tttcagatgt ggaggaacct gatagtacta ctggaaatga ggaggaaaag ccctccccgc 1200 

cgaaactaac tcagcgctgc agacctaata ttttgagacc ctcggcccag cgtgtgtcat 1260 

cccggaaacg tgctgctgtt aatdgcatag aagatttatt g^aagagccc actgaacctt 1320 

tggacttgtc cttaaagcga ccccgcccgc agtagggcgc ggtgccagtt ttttctctct 1380 

agcttccggg tgactcagtg caataaaaat tttcttggca acaggtgtat gtgtttactt 1440 

tacgggcggg aagggattag gggagtataa agctggaggg gaaaaatctg aggctgtcag 1500 

atcgagtgag aagttccatg gacttgtacg agagcctaga gaatctaagt tctttgcgac 1560 

gtttgctgga ggaggcctcc gacagaacct cttacatttg gaggtttctg ttcggttccc 1620 

ctctgagtcg ctttttgcac cgggtgaagc gagagcacct gacggaattt gatgggcttt 1680 

tagagcagct gcctggactg tttgattctt tgaatctcgg ccaccggacg ctgctagagg 1740 

agaggctttt Cccacaattg gact^ttcct ctccaggccg tctgtgttica gcgcttgctt 1800 

ttgctgtaca tctgttggac agatggaacg agcagacgca gctcagcccg ggttacactc 1860 

tggacttcct gacgctatgc ctatggaagt tcggaatcag gagggggagg aaigctgtacg 1920 

ggcgcttggt ggagaggcat ccgtctctgc gccagcagcg tctgcaagct caagtgctgc 1980 

tgaggcggga ggatctggaa .gccatrttcgg aggaggagag cggcatggaa gagaagaat.c 2040 

cgagagcggg gctggaccct ccggcggagg agtagggggg ataccggacc cttLtticctga 2100 

gttggctttg ggggcggtgg ggggcgcttc tgtggtacgt gaggatgaag aggggcgcca 2160 

acgcggtcag aagagggagc attttgagtc ctcgactttc ttggctgatg taaccgtggc 2220 

cctgatggcg aaaaacaggc tggaggtggt gtggtacccg gaagtatggg aggactttga 2280 

gaagggggac ttgcacctgc tggaaaaatia taactttgag caggtgaaaa catactggat 2340 

gaacccggat gaggactggg aggtggtttt gaaccgatac ggcaaggtag ctctgcgtcc 2400 
cgactgtcgc taccaggttc gcgacaaggt ggtcctgcga cgcaacgtgt acctgttggg 2460 
caacggcgcc accgtggaga tggtggaccc cagaaggggt ggttttgtgg ccaatatgca 2520 
agaaatgtgc cctggggtgg tgggcttgtc tggggtgact tttcatagtg tgaggtttag 2580 
cggtagcaat tttgggggtg tggttattac cgcgaacact cctgtggtcc tgcataattg 2640 
ctactttttt ggcttcagca acacctgtgt ggaaatgagg gtgggaggca aagtgcgcgg 2700 
gtgttccttt tacgct^gct ggaagggggt ggtgagccag ggtaaggcta a^gt^tctgt 2760 
tcacaagtgt atgttggaga gatgcacctt gggcatttcc agtgagggct tcctccacgc 2820 
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cagcgacaac gtggcttctg acaacggctg cgcctttctt atcaagggag ggggtcgcat 2880 

ctgtcacaac atgatatgcg gccctgggga tgtcccccca aagccttacc agatggttac 2940 

Gtgcacagat ggcaaggtgc gcatgctcaa gcctgtgcac attgtgggcc accggcgcca 3000 

ccgctggcca gagtttgaac acaatgtgat gacccgctgt agcttgtacc tgggaggcag 3060 

gcgaggagtt ttcttgccca gacagtgtaa cctggcccac tgcaacgtga tcatggaaca 3120 

atccgccgct acccaggttt gctttggagg aatatttgat ataagcatgg tggtgtataa 3180 

gatcctgcgc tacgacgact gtcgggctcg tactcgaacc tgcgactgcg gagcctctca 3240' 

cctgtgtaac ctgactgtga tggggatggt gactgaggag gtgcgactgg accactgtca 3300 

gcactcttgc ctgcgggagg agttttcttc ctcggacgag gaggactagg tag^tggttg 3360 

^ggcgtggcc agcgagaggg tgggctataa aggggaggtg tcggctgacg ctgtcttctg 3420 

tttttcaggt accatgagcg gatcaagcag ccagaccgcg ctgagcttcg acggggccgt 3480 

gtacagcccc tttctgacgg ggcgcttgcc tgcctgggcc ggagtgcgtc agaatgttac 3540 

cggttcgacc gtggacggac gtcccgtgga tccatctaac gctgcttcta tgcgctacgc 3600 

tactatcagc acatctactc tggacagcgc cgctgccgcc gcagccgcca cctcagccgc 3 660 

tctctccgcc gccaagatca tggctattaa cccaagcctt tacagccctg tatccgtgga .3720 

cacctcagcc ctggagcttt accggcgaga tctagctcaa gtggtggacc aactcgcagc 3760 

cgtgagccaa cagttgcagc tggtgtcgac ccgagtggag caactttccc gccctcccca 3840 

gcaaccgcaa aaattcaata aacagaattt aataaacagc acttgagaaa agtttaaact 3900 

^9^g9ttgac tttattcctg gatagctggg gggagggaac ggcgggaacg gtaagacctg 3960 

gtccatcgtt cccggtcgtt gagaacacgg tggatttttt ccaagacccg atagaggtgg 4020 

gtctgaacgt tgagatacat gggcatgagc ccgtctcggg ggtggaggta ggcccactgc 4080 

A999<:c^cgt tttcaggggt ggtgttgtaa atgatccagt cgtaggcccc ccgctgggcg 4140 

*^99^9ctgga agatgtcctt cagcagcaag ctgatggcaa cgggaagacc cctggtgtag 4200 

gtgttgacaa agcggttgag ttgggagggg tgcatgcggg gactgatgag . gtgcat:tttg • 4260 
gcctiggatci: tgaggttggc tatgttgccg cccagatcgc gcctgggatt catgttatgc 4320 
aagaccacca gcaccgagta accggtgcag cgggggaatt tgtcgtgcag cttggaaggg 4380 
aaagcgtgga agaatttgga gacccctcgg tgcccgccta ggttttccat gcactcatcc 4440 
atgatgatgg cgatgggccc ccgggaggca gcctgggcaa aaacgttgcg ggggtccgtg 4500 
acatcgtagt tgtggtcctg ggcg'agttca tcataggaca ttttgacaaa gcgcgggcag 4560 
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agggtcccag actggggaat gatggttcca tccggtccgg gggcgtagtt gccctcgcag 4620 

atttgcattt cccaggcttt gatttcagag ggagggatca tgtcaacctig gggggcgatg 4680 

aaaaaaatgg tctctggggc gggggtgatg agctgggtgg aaagcaggtt gcgcaagagc 4740 

tgtgacttgc cgcagccggt gggcccgtag atgacagcta tgacgggttg cagggtgtag 4800 

tttagagagc tacaactgcc atcatccttc aaaagcgggg ccacactgtt taaaagttct 4860 

ctaacatgta agttttcccg cactaagtcc tgcaggagac gtgaccctcc tagggagaga 4920 

agttcaggaa gcgaagcaaa gtttttaagt ggcttgaggc catcggccaa gggcaagttc 4980 

ctgagagttt gactgagcag ttccagccgg tcccagagct cggttacgtg ctctacggca 5040 

tctcgatcca gcagacctcc tcg.tttcggg ggttggggcg gctctggctg tagggaatga 5100 

ggcggtgggc gtccagctgg gccatggtgc ggtccctcca tgggcgcagg gttctcttca 5160 

Sggtggtctc ggtcacggtg aatgggtggg ccccgggctg ggcgctggcc agggtgcgct 5220 

tgaggctgag gcggctggtg gcg^accgtt gcttttcgtc tccctgcaag tcagceaaat 5280 

agcaacggac catgagctca tagtccaggc tctctgcggc atgtcctttg gcgcgaagct 5340 

tgcctttgga aacgtgcccg. cagtttgagc agagcaagca ttttagcgcg tagagttttg 5400 

gcgccaagaa.cacggattcc ggggaataag catccccacc gcagttggag caaac^gttt 5460 

cgcattccac cagccaggtc agctgag^at cttttgggtc aaaaaccaag cgcccgccgt 5520 

tttttttgat gcgcttccta cctcgggtct ccatgaggcg gtgcccgcgt tcggtgacga 5580 

agaggctgtc ggtgtctccg tagacggagg tcagggcgcg ctcctccagg ggggtcccgc 5640 

ggtcctcggc gtagagaaac tcgcaccact ctgacataaa cgcccgggtc caggctagga 5700 

cgaatgaggc gatgtgggaa gggtaccggt cgttatcgat gagggggtcg gtt.ttt:tcca 5760 

aggtgtgcag gcacatgtcc ccctcgtccg cttccaaaaa tgtgattggc ttgtaggtgt 5820 

, aagtcacgtg atcctgtcct tccgcggggg tataaaaggg ggcgtttccc ccctcctcgt 5880 
cactctcttc cggttcgctg tcgccaaagg ccagctgttg gggtacgtaa acgcgggtga 5940 
aggcgg'gcat gacctgtgcg ctgaggttgt cagtttci;at atacgaggaa gatttgatgg 6000 
cgagcgcccc cgtggagatg cccttgaggt gctcggggcc catttggtca gaaaacacaa 6060 
tctgtcggtt atcaagcttg gtggcaaaag acccgtagag ggcgttggag agcaacttgg 6120 
cgatggagcg ctgggtttgg tttttttccc ggtcggcttt ttccttggcc gcgatgttga 6180 
gctggacgta ctccctggcc acgcacttcc agccgggaaa aacggccgtg cgctc^tccg 6240 
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gcaccagcct cacgctccat ccgcggttgt gcagggtgat gacgtcgatg ctggtggcca 6300 

cctctccgcg caggggctcg ttggtcbagc agaggcgacc gcccttgcga gagcagaagg 6360 

ggggcagggg gtcaagcagg cgctcgtccg gggggtcggc gtcgatggta aagatggcgg 6420 

gcagcaggtg tttgtcaaag taatcgatct gatgcccggg gcaacgcagg gcggtttccc 6480 

agtcccgcac cgccaaggcg cgctcgtatg gactgagggg ggcgccccag ggcatgggat 6540 

gcgtcagggc cgaggcgtac atgccgcaga tgtcatagac gtaaaggggc tcctccagga 6600 

cgccgaggta ggtggggtag cagcgccccc cgcggatgct ggcccgtacg tagtcgtaga 6660 

gctcgtgcga gggggccaga aggtggcggc tgaggtgagc gcgctggggc ttttcatctc 6720 

ggaagaggat ctgcctgaag atggcgtggg agttggagga gatggtgggc cgctgaaaaa 6780 

tgttgaagcg ggcgtcgggc agacccacgg cctcgccgat aaagtgggcg taggactctt 6840 

gcagcttttc caccagggag gcggtgacca gcacgtccag agcgcagtag tccagggttt 6900 

cccgcacgat gtcataatgc tctjtcctttt tttccttcca gaggtctcgg ttgaagagat 6960 

actcttcgcg gtctttccag tactctt'gga gaggaaaccc gttt^cgtct ccacggtaag 7020 

agcccaacat gtaaaactgg ttgacggcct gatagggaca gcatcccttc tpcaogggca 7080 

gcgagtaggc cagggcggcc ttgcgcaggg aggtgtgagt cagggcaaag gtg.tcgcgga 7140 

ccataacttt tacaaactgg tac'ttaaagt cccggtcgtc gcacatgcct cgcdcccagt 7200 

ctgagtagtc tgtgcgcttt ttgtgcttgg ggttaggcag ggagtaggtg acgtcgttaa 7260 

agaggatttt gccacatctg ggcataaagt tgcgagagat tctgaagggg ccgggcacct 7320 

ccgagcggtt gttgatgact tgggcagcca ggagaatttc gtcgaagccg ttgatgttgt 7380 

gccccacgac gtagaactct atgaaacgcg gagcgccgcg cagcaggggg cacttttcaa 7440 

gttgctggaa agtaagttcc cgcggctcga cgccgtgttc cgtgcggctc cagtcctcca 7500 

ccgggtttcg ctccacaaaa tcctgccaga tgtggtcgac tagcaagagc tgcagtcggt 7560 

cgcgaaattc gcggaatttt ctgccgatgg cttgcttctg ggggttcaag caaaaaaagg 7620 

tgtctgcgtg gtcgcgccag gcgtcccagc cgagctcgcg agccagattc agggccagca 7680 

gcaccagagc cggctcaccg gtgattttca tgacgaggag aaagggcacc agctgttttc 7740 

cgaacgcgcc catccaggtg taggtctcca cgtcgtaggt gagaaacaga cgttcggtcc 7800 

gcgggtgcga tcccaggggg aaaaacttga tgggctgcca ccattgggag ctctgggcgt 7860 

ggatgtgatg gaagtaaaag tcccggcggc gcgtggaaca ttcgcgctgg tttttgtaaa 7920 

agcggccgca gtggtcgcag cgcgagacgg agtgaaggct gtgaatcagg tgaatdttgc 7980 
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Stcgctgagg gggccccaga gccaaaaagc ggagcgggaa cgaccgcgcg gccacttcgg 8040 

cgtccgcagg caagatggat gagggttcca ccgttccccg cccgcggacc gaccagactt 8100 

ccgccagctg cggcttcagt tcttgcacca gctctcgcag cgtttcgtcg ctgggcgaat 8160 

cgtgaatacg. gaagttgtcg ggtagaggcg ggaggcggtg gacttccagg aggtgtgtga 8220 

gggccggcag gagatgcagg tggtacttga tttcccacgg atgacggtcg cgggcgtcca 8280 

aggcgaagag atgaccgtgg ggccgcggcg ccaccagcgt tccgcggggg gtq'tttatcg 8340 

gcggcgggga cgggctcccg gcggcagcgg cggctcggga cccgcgggca agtcgggcag 8400 

cggcacgtcg gcgtggagct cgggcagggg ctggtgctgc gcgcggagct gactggcaaa 8460 

ggctatcacc cggcgattga cgtcctggat ccggcggcgc tgcgtgaaga ccaccggacc 8520 

cgtggtcttg aacctgaaag agagttcgac agaatcaatc tcggcatcgt taaccgcggc 8580 

ctggcgcagg atttcggcca cgtccccgga gttgtcttga tacgcgattt ctgccatgaa 8640 

ctggtcgatt tcctcttcct gcaagtctcc gtgaccggcg cgttcgacgg tggccgcgag 8700 

atcgttggag atgcggccca tgagctggga aaaggcattg atgccgacct cgttccacac 8760 

tcggctgtac accacctctc cgtgaacgtc gcgggcgcgc atcaccacct gggcgagatt 8820 

gagttccacg tggcgggcga aaacqggata gtttcggagg cgctgataca gatagttgag 8880 

ggtggtggcg gcgtgctcgg ccacaaaaaa atacatgatc cagcggcgga gggtcagccc 8940 

gttgatgtcg cccagcgcct ccaggcgttc catggcctcg taaaagtcca cggcaaagtt 9000 

gaaaaattgg ctgttcctgg ccgagaccgt gagctcttct tccaagagcc gaatgagatc 9060 

cgccacggtg gccctgactt cgcgttcgaa agccccgggt gcctcctcca cctcttcctc 9120 

ctcgacttcc tcgaccgctt cgggcacctc ctcttcctcg accaccacct caggcggggc 9180 

tcggcggcgc cggcggcgga cgggcaggcg gtcgacgaaa cgctcgatca tttcccccct 9240 

ccgtcgacgc atggtctcgg tgacggcgcg accctgttcg cgaggacgca gggtgaaggc 9300 

gccgccgccg agcggaggta acagggagat cggggggcgg tcgtggggga gactgacggc 9360 
gctaactatg catctgatca atgtttgcgt agtgacctcg ggtcggagcg agctcagcgc • 9420 

ttgaaaatcc acgggatcgg aaaacegttc caggaacgcg tctagccaat cacagtcgca 9480 

aggtaagctg aggaccgtct cgggggcttg tctgttctgt cttcccgcgg tggtgctgct 9540 

gatgaggtag ttgaagtagg cgctcttgag gcggeggatg gtggacagga gaaccacgtc 9600 

tttgcgccca gcttgctgta tccgcaggcg gtcggccatg ccccacactt ctccfctgaca 9660 
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gcggcggagg tccttgtagt attcttgcat cagcctttcc acgggcaccc cgtcttcttc 9720 

ttccgctcgg ccggacgaga gccgcgtcag gccgtacccg cgctgcccct gtggttggag 9780 

cagggccagg tcggccacga cgcgctcggc Cagcacggcc tgctggatgc gggtgagggt 9840 

gtcctgaaag tcgtcgagat ccacaaagcg gtggtacgcg ceagtgttga tggbgtaggt 99oo 

goagttgctc atgacggacc agtttacggt ctgggtgcca tggeccacgg tttccaggta 9960 

gcggagacgc gagtaggccc gcgtcccgaa gatgtagtcg ttgcaggtcc gcagc^ggca 10020 

ctggtagccc accagcagat gcggcggcgg ctggcggtag aggggccacc gctgggtggc 10080 

gggggcgttg ggggcgagat cttccaacat gaggcggcga tagccgtaga tgtagcgcga 10X40 

catccaagtg atgocgctgg ccgtggtgct ggcgcgggcg tagtcgcgaa cgcggttcca 10200 

gatgtttqgc agcggctgga agtactcgat ggtggggcga ctctgccccg tgaggcgggc 10260 

gcagtcggcg atgctctacg gggaaaaaga agggccagtg aacaaccgcc ttccgtagcc 10320 

ggaggagaac gcaagggggt caaagaccac ogaggctcgg gttcgaaacc cgggtggcgg 10380 

cccgaatacg gagggcggtt ttttgctttt ttctcagatg catccegtgc tgcggcagat 10440 

gcgtccgaac gcggggtccc agtccccggc ggtgcctgcg gccgtgacgg cggcttctac 10500 

ggccacgeeg cgctccaeco cgcctaceac ggcccaggcg gcggtggctc tgcgcggcgc losso 

aggggaaocc gaagc^gagg cggtgttgga cgtggaggag ggccaggggt tggctcggct 10620 

gggggccctg agtcccgagc ggcacccgcg cgtggctctg aagcgcgacg cggcggaggc 10680 

gtacgtgccg cggagcaatc tgtttegcga ccgcagcggc gaggaggccg aggagatgcg 10740 

agacttgcgt tttcgggcgg ggagggagCt gcgtcacggg ctggaccggc agagggttct 10800 

gagagaggag gactttgagg cggacgagcg cacgggggtg agtcccgcgc gggctcacgt 10860 

ggcggccgcc aacctggtga gcgcgtacga gcagacggtc aaggaggaga tgaacttcca 10920 

gaagagctto aatcatcacg tgc^cacgct gattgcgcgc gaagaggtgg ccatcpgcct 10980 

catgcatctg tgggattttg tggaggcgta cgttcagaac cceagcagca agccgctgae 11040 

ggctcagctg ttcctcatcg tgcaacatag tcgagacaac gaaacgttca ggga^ccat llioo 

gctgaacatt gcagagcctg aggggegctg gctcttggat ctcattaaca tcttgckgag lliso 

tatcgtagtg caggagcgct cgctgagcct ggccgacaag gtggctgcca tcaacbacag 11220 

catgctgtcg ctgggcaaat tttacgcccg caagatctac aagtctccgc tcgtccecat 11280 

agaeaaggag gtgaagatag acagctttta catgcgcatg gcgcccaagg tgctgactct 11340 

aagcgacgac ctgggggtgt aecgcaaega ccgcatacac aaggcggtga gcgccagccg 11400 
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ccggcgcgag ctgagcgacc gcgagctttt gcacagcctg catcgggcgt tgactggtgc 11460 

cggcagcgcc gaggcggccg agtact'ttga cgccggagcg gacttgcgct ggcagccatc 11520 

ccgacgcgcg ctggaggcgg ctggcgtcgg ggagtacggg gtcgaggacg acgatgaagc 11580 

ggacgacgag ttgggcattg acttgtagcc gtttttcgtt agatatgtcg gogaacgagc 11640 

cgtctgccrgc cgccatggtg acggcggcgg gcgcgcccca ggacccggcc acgcgcgcgg 11700 

cgctgcagag tcagccttcc ggagtgacgc ccgcggacga ctggtccgag gccatgcgtc 11760 

gcatcctggc gctgacggcg cgcaaccccg aggctttitcg gcagcagccg caggcaaacc 11820 

ggtttgcggc cattttggaa gcggtggtgc cctccagacc caaccccacc cacgaaaagg 11880 

tgctggccat cgtcaacgcc ctggcggaga ccaaggccat ccgcccagac gaggccgggc 1194 0 

aggtttacaa cgcgctgcta gaaagggtgg gacgctacaa cagctccaac gtgcagacca 12000 

atctggaccg cttggtgacg gacgtgaagg aggccgtagc ccagcgagag cggtttttca 12060 

aggaagccaa tctgggctcg ctggtggccc tcaacgcctt cctgagcacg ctgccggcga 12120 

acgtgccccg cggtcaggag gactacgtga actttctgag cgccctccgc ctgatggtgg 12180 

ccgaggtgcc gcagagcgag gtgeaccagt ctggccccaa ctactacttc cagacct'ccc 12240 

ggcagggcct gcagacggta aacccgacgc aggcctttca gaacctgcag ggcctttggg 12300 

gggtgcgcgc tccgctgggc gaccgcagca cggtgtccag cctgctgacc cccaatgccc 12360 

ggctgctctt gcttctcatt gctccgttca ccgacagcgg ttccatcagc cgcgactctt 12420 

acctgggaca cctgctcacc ctgtaccggg aggccatcgg gcaggcgcgg gtggacgagc 12480 

agacgtacca ggaaatcacc agcgtgagcc gcgcgctggg gcaggaggac acgggcagct 12540 

tggaggcgac tctgaacttc ctgctgacca accggcggca gcgcctacct ccccagtacg 12600 

cgctgaacgc ggaggaggag cgcatcctgc gtttcgtgca gcagagcacc gcgctgtact 12660 

tgatgcggga aggcgcctct cccagcgctt; cgctggacat gacggcggcc aacatggagc 12720 

catcgttcta cgccgccaac cgtcccttcg tcaaccggct aatggactat ttgcatcggg 12780 

cggcggccct gaacccggaa tactttacta acgtcatcct gaacgaccgt tggctgccac 12840 

ctcccggctt ctacacgggg gagttcgacc tcccggaggc caacgacggt ttcatgtggg 12900 

acgacgtgga cagcgtgttc ctgcccggca agaaggaggc gggtgactct cagagccacc 12960 

gcgcgagcct cgcagacctg ggggcgaccg ggcccgcgtc tccgctgcct cgcctgccga 13020 

gcgccagcag cgccagcgtg gggcgggtga gccgtccgcg cctcagcggt gaggaggact 13080 
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ggtggaacga tccgctgctc cgtccggccc gcaacaaaaa cttccccaac aacgggatag 13140 
aggatttggt agacaaaatg aaccgtt'gga agacgtatgc ccaggagcat cgggagtggc 13200 
aggcgaggca acccatgggc cctgttctgc cgccctctcg gcgcccgcgc agggacgaag 13260 
acgccgacga ttcagccgat' gacagcagcg tgttggatct gggcgggagc gggaacccct 13320 
ttgcccacct gcaacctcgc ggcgtgggbc ggcggtggcg ctaggaaaaa aaattattaa 13380 
aagcacttac cagagccatg gtaagaagag caacaaaggt gtgtcctgct ttcttcccgg 13440 
tagcaaaatg cgtcgggcgg tggcagttcc ctccgcggca atggcgttag gcccgccccc 13500 
ttcttacgaa agcgtgatgg cagcggccac cctgcaagcg ccgttggaga atccttacgt 13560 
gccgccgcga tacctggagc ctacgggcgg gagaaacagc attcgttact cggagctgac 13620 
gcccctgtac gacaccaccc gcctgtacct ggtggacaac aagtcagcag atatcgccac 13680 
cttgaactac cagaacgacc acagcaactt tctcacgtcc gtggtgcaga acagcgacta 13740 
cacgcccgcc gaagcgagca cgcagaccat taacttggac gaccgctcgc gctggggcgg 13800 
ggacttgaaa accatcctgc acactaacat: gcccaacgtg aacgagttca tgtutaccaa 13860 
ctcgttcagg gctaaactta cggtggcgca cgaggccgac aaggacccgg tttatcfagtg 13920 
ggtgcagctg acgctgccgg aggggaactt ttcagagatt acgaccatag acctgatgaa 13980 
caacgccatt atcgaccact acctggcggt agccagacag cagggggtga aagaa'agcga 14040 
gatcggcgtc aagtttgaca cgcg'caactt tcgtctgggc tgggacccgg agacggggct 14100 
tgtgatgccg ggggtgtaca cgaacgaagc tttccatccc gacgtggtcc tcttgccggg 14160 
ctigcggggtg gactttacct acagccggtt aaacaacctg ctaggcatac gcaagagaat 14220 
gccctttcag gaagggtttc agatcctgta cgaggacctg gagggcggta acatcccggc 14280 
cctgctggac gtgccggcgt acgaggagag catcgccaac gcaagggagg cggcgatcag 14340 
gggcgataat ttcgcggcgc agccccaggc ggctccaacc ataaaacccg ttttggaaga 14400 
ctccaaaggg cggagctaca acgtaatagc caacaccaac aacacggctt acaggagctg 14460 
gtatctggct tataactacg gcgacccgga gaagggggtt agggcctgga. ccctgctcac 14520 
cactccggac gtgacgtgcg gttcagagca ggtctactgg tcgctgcctg acatgtacgt 14580 
ggaccdtgtg acgtttcgct ccacg^agca agttagcaac tacccagtgg tgggagcgga 14640 
gcttatgccg attcacagca agagctttta caacgagcag gccgtctact cacagctcat* 14700 
tcgtcagacc accgccckaa cgcacgttte caaccgcttc cccgagaacc aaatcctagt 14760 
gcgacctcca gcgcccacca tcaccaccgt cagcgagaac gtgcccgctc taaccgatca 14820 
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cgggacgctg cctttgcaga acagcatccg cggagttcag cgagttacca tgacggacgc 14680 

ccgtcgtcgg acctgtccct acgtctacaa agccttggga atcgtggccc cgcgcgtcct: 14940 

gtcgagtcgc actttctaga tgtccatcct: catctctccc agcaacaata ccggttgggg 15000 

tctgggcgtg accaaaatgt acggaggcgc caaacgacgg tcdccacaac atticegtgcg 15060 

agtgcgcggg cactttagag ccccatgggg gtcgcacacg cgcgggcgca ccggccgaac 15120 

caccgtcgac gacgtgatcg atagcgtggt ggccgacgcc cgcaactacc agcccgctcg 15180 

atccacggtg gacgaagtca tcg^cggcgt ggtggccgac gccagggcct acgcccgcag 15240 

aaagtctcgt ctgcgccgcc gccgttcgct aaagcgcccc acggccgcca tgaaagccgc 15300 

tcgctctctg ctgcgtcgcg cacgtatcgt gggtcgccgc gccgccagac gcgcagccgc 15360 

caacgccgcc gccggccgag tgcgccgccg ggccgcccag caggccgccg cJcgccatctc 15420 

cagtctatcc gccccccgac gcgggaatgt gtactgggtc agggactcgg ccaccggcgt 15480 

gcgagttccc gtgagaaccc gtcctcctcg tccctgaata aaaagttcta agcccaatcg 15540 

gtgttccgtt gtgtgttcag ctcgtcatga ccaaacgcaa gtttaaagag gagctgctgc 15600 

aagcgctggt ccccgaaatc tatgcgccgg cgccggacgt gaaaccgcgt cgcgtgaaac 15660 

gcgtgaagaa gcaggaaaag ctagagacaa aagaggaggc ggtggcgttg ggagacgggg -15720 

aggtggagtt tgtgcgctcg ttcgdgccgc gtcggcgagt gaattggaag gc;gcgcaagg 157.80 

tgcaacgggt gctgcgt:ccc ggcacggtgg tgtctttcac cccgggtgaa aaatccgcct 15840 

ggaagggc^t aaagcgcgtg tacgatigagg tgtacgggga cgaagacact ctggagcagg 15900 

cgctggatag aagcggggag tttgcttacg gcaagagggc gaggacgggc gagatcgcca 15960 

tccqgctgga cacttccaac cccaccccca gtctgaaacc cgtgacgctg caacaggtgt 16020 

tgccggtgag cgccccctcg cgacgcggca taaaacgcga gggcggcgag ctgcagccca 16080 

ccatgcagct cctggttccc aagaggcaga aactagagga cgtactggac atgataaaaa 16140 

tggagcccga cgtgcagccc gatattaaaa tccgtcccat caaagaagtg gcgccgggaa 16200 

t999cgtgca gaccgtggac atccagattc ccatgaccag cgccgcacag gcggtagagg 16260 

ccatgcagac cgacgtgggg atgatgacgg acctgcccgc agctgctgcc gccgtggcca 16320 

gcgccgcgac gcaaacggaa gccggcatgc agaccgaccc gtggacggag gcgcccgtgc 16380 

agccggccag aagacgcgtc agacggacgt acggccccgt ttctggcata atgccggagt 16440 

acgcgccgca tccttccatc atccccaccc ccggctaccg ggggcgcacc taccgtccgc 16500 
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gacgcagcac cactcgccgc cgtcgccgca cggcacgagt cgccaccgcc agagjtgagac 16560 
gcgtaacgac acgtcgcggc cgccgcttga ccctgcccgt ggtgcgctac catcccagca 16620 
ttcttcaaaa aaccgctcct acgttgcaga tgggcaagct tacttgtcga ctccgtatgg 16680 
ccgt'gcccgg ctaccgagga agatcccgcc gacgacggac tttgggaggc agcggtttgc 16740 
gccgccgtcg gg'cggtitcac cggcgcctca agggaggcat tctgccggcc ctgatcccca 16800 
taatcgccgc agccatcggg gccattcccg gaatcgccag cgtagcggtg caggctagcc 16860 
agcgccactg attttactaa ccctgtcggt cgcgccgtct ctttcggcag actcaacgcc 16920 
cagcatggaa gacatcaatt tctcctctct ggccccgcgg cacggcacgc ggccgtatiat: 16980 
ggggacgtgg agcgagatcg gcac'gaacca gatgaacggg ggcgctttca at.tggagcgg 17040 
tgtgtggagc ggcttgaaaa atttcggttc cactctgaaa acttacggca accgggtgtg 17100 
gaactccagc acggggcaga tgdtc^aggga caagctaaag gacacgcagt ttcagcaaaa 17160 
ggtggtggac ggcatcgctt cgggcctcaa cggcgccgtc gacctggcca accaggccat 17220 
tcaaaaggaa attaacagcc gcctggagcc gcggccgcag gtggaggaga acctgccccc 17280 
tctggaggcg ctgcccccca agggagagaa gcgcccgcgg cccgacatgg aggagacgct 17340 
agttactaag agcgaggagc cgccatcata cgaggaggcg gtgggtagct cgcagctgcc 17400 
gtccctcacg ctgaagccca ccacctatcc catgaccaag cccatcgcct ctatggcgcg 17460 
ccccgtggga gtcgacccgc ccatcgacgc ggtggccact ttggacctgc cg'cgccccga 17520 
acccggcaac cgcgtgcctc ccgtccccat cgctccgccg gtttctcgcc ccgccatccg 17580 
ccccgtcgcc gtggccactc ccc^ctatcc gagccgcaac gcc.aactggc agaccaccct: 17640 
caacagtatt gtcggactgg gggtgaagtc tctgaagcgc cgtcgctgtt tttaaagcac 17700 
aatttattaa acgagtagcc cCgtcttaat ccatcgttgt atgtgtgcct atatcacgcg 17760 
ttcagagcct gaccgtccgt caagatggcc actccgtcga tgatgccgca gtggtcgtac 17820 
atgcacatcg ccgggcagga cgcctcggag tacctgagcc cgggtctggt gcagtttgcc 17880 
• cgtgcgacgg- aaacctactt ctcactgggc aacaagttca ggaaccccac cgtggcgccc 17940 
acccacgacg tcaccaccga tcggtc'ccag cgactgacaa tccgcttcgt ccccgtggac 18000 
aaggaagaca ccgcttactc ctae:'aaaacc cgcttcacgc tggccgtggg cgacaaccgg 18060 
gtgctagaca tggccagtac ctactttgac atccgcggcg tgatcgaccg cggacctagc 18120 
ttcaagcctt actccggcac ggcttacaac tcactggctc ccaaaggggc gcccaacaac 18180 
agccaatgga acgccacaga taacgggaac aagccagtgt: gttttgctca ggcagctttt 18240 
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ataggtcaaa gcattacaaa agacggagtg caaatacaga actcagaaaa tcaac'aggct 1B300 

gctgccgaca aaacttacca accagagcct caaattggag tttccacctg ggataccaac 18360 

gttaccagta acgctgccgg acgagtgtta aaagccacca ctcccatgct gccatgttac X8420 

ggttcatatg ccaatcccac ' taatccaaac gggggtcagg caaaaacaga aggagacatt 18480 

tcgctaaact ttttcacaac aactgcggca gcagacaata atcccaaagt ggtftctttac 18540 

agcgaagatg taaaccttca agdccccgat actcacttag tatataagcc aacggtggga 18600 

gaaaacgtta tcgccgcaga agccctgcta acgcagcagg cgtgtcccaa cagagcaaac 18660 

tacataggtt tccgagataa ctttatcggt ttaatgtatt ataacagcac agggaacatg 18720 

ggagttctgg caggtcaggc ctcgcagtta aacgcagttg tagacctgca agatcgaaac 18780 

acggaactgt cctatcagct aatgctagat gctctgggtg acagaactcg atatttctca 18840 

atgtggaatc aggccgtgga cagctacgat ccagacgtta ggattatcga gaaccatggg 18900 

gtggaagacg agctgcccaa ttactgtttt ccactcccag gcatgggtat ttttaactcc 18960 

tacaaggggg taaaaccaca aaatggcggt aatggtaact gggaagcaaa cggggaccta 19020 

tcaaatgcca atgagatcgc tttaggaaac atttttgcca tggaaattaa cctccacgca 19080 

aacctgtggc gcagcttctt gtacagcaat gtggcgctgt acctgccaga cagctataaa 19140 

ttcactcccg ctaacatcac tctgcccgcc aaccaaaaca cctacgagta tatcaacggg 19200 

cgcgtcactt ctccaacccc ggtggacacc tttgttaaca ttggagcccg atggtcgccg 19260 

gatcccatgg acaacgtcaa cccetttaac catcaccgga acgcgggccc ccgttaccgc 19320 

tccatgctgc tgggaaatgg acgcgtggtg ccttitccaca tacaagtgcc gcaaciaattt 19380 

ttcgcgatta agaacctcct gcttttgccc ggctcctaca cttacgagtg gagcttcaga 19440 

aaagacgtga acatgattct gcagagcacc ctgggcaatg atpttcgagt ggacggggcc 19500 

agcgtccgca ttgacagcgt caacttgtac gccaactttt tccccatggc gcacaacacc 19560 

gcttctacct tggaagccat gctgcgaaac gacaccaacg accagtcgtt taacgactac 19620 

ctcagcgcgg ccaacatgct ttaticccatt ccggccaacg ccaccaacgt tcccatttcc 19680 

attccctccc gcaactgggc ggccttccgg ggatggagct tcacccgcct taaagccaag 19740 

gaaacgcctt ccttgggctc cggctttgac ccctactttg tgtactcagg caccattcct 19800 

tacctggacg gcagctttta cctpaaccac actttcaaac gtctgtccat catgttcgat 19860 

tcttccgtaa gttggccggg caacgaccgc ctcctgacgc cgaacgagtt cgaaattaag 19920 
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cgcattgtgg acggggaagg ctacaacgtg gctcaaagta acatgaccaa agactggttt 19980 

ttaattcaaa tgctcagcca ctacaacatc ggctaccaag gcttctatgt tcccgagggc 20040 

tacaaggatc ggatgtattc tttcttccga aactttcagc ccatgagccg ccaggtgccg 20100 

gatcccaccg ctgccggcta tcaagccgtt cccctgccca gacaacacaa caactcgggc 20160 

tttgtggggt acatgggccc gaccatgcgc gaaggacagc catacccggc caactacccc 20220 

tatcccctga tcggcgctac cgccgtcccc gccattaccc agaaaaagtt tttgtgcgac 20280 

cgcgtcatgt ggcgcatacc tttttccagc aactttatgt caatgggggc cctgaccgac 20340 

ctcggacaga acatgcttta cgctaactcc gcccatgccc tggatatgac ttttgaggtg 20400 

gaccccatga acgagcccac gttgctgtac atgctttttg aggt^ttcga cgtggjtcaga 20460 

gtgcaccagc cgcaccgcgg tattatcgag gccgtgtacc tgcgcacccc cttctctgcg 20S20 

ggcaatgcca ccacataagc cgctgaacta gctggttttt accccagatc ccatgggctc 20580 

cacggaagac gaactgcggg ccattgtgcg agacctgggc tgcggaccct acttcctggg 20640 

cacctttgac aagcggtttc ccgggttcgt gtctcctcgc aaactcgcgt gcgcgatcgt 20700 

gaataccgcc ggccgagaga ccggaggaga gcattggcta gctdtgggct ggaacccccg 20760 

ctcgtccacg tttttcctgt tcgacccctt tggcttttca gaccaacgct tgaagcagat 20820 

ctatgcattt gaatatgagg gtctactcaa gcgaagcgcg ctggcctcct ccgccgatca 2.0880 

ctgtctaacc ctggtaaaga gcactcagac ggttcagggc cctcacagcg ccgcctgtgg 20940 

ccttttttgt tgcatgtttt tgcacgcctt tgtgaactgg ccggacaccc ccatggaaaa 21000 

caaccccacc atggacctcc tgactggcgt tcccaactcc atgctccaaa gccccagcgt 21060 

gcagaccacc ctcctccaaa accagaaaaa tctgtacgcc tttctgcaca agcactctcc 21120 

ctactttcgc cgccatcggg aacaaataga aaatgcaacc gcgtttaaca aa&ctctgta 21180 

acgtttaata aatgaacttt ttattgaact ggaaaacggg tttgtgattt tfcaaaaatca 21240 

aaggggttga gctggacatc catgtgggag gccggaaggg tggtgttctt gtact'ggtac 21300 

ttgggcagcc acttaaactc tggaatcaca aacttgggca gcggtatttc tgggaagttg 21360 

tcgtgccaca gctggcgggt cagctgaagt gcctgcagaa catcgggggc ggagatcttg 21420 

aagtcgcagt ttatctggtt cacggcacgc gcgttgcggt acatgggatt ggcacactga 21480 

aacaccagca ggctgggatt cttgatgcta gccagggcca cggcgtcggt cacgtcaccg 21540 

gtgtcttcta tgttggacag cgaaaaaggc gtgactttgc aaagctggcg tcecgcgcga 21600 

ggcacgcaat ctcccaggta gttgcactca cagcggatgg gcagaagaag atgcttgtgg 21660 
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eegegggeca tgtagggata ggccgctgcc ataaaagctt cgatctgcct gaaagcctgc 2X720 

ttggccttgt gcccttcggt ataaaaaaca ccgcaggact tgttggaaaa ggtattactg 2X780 

gcgcaagcgg catcgtgaaa geaagcgcgt gcgtcttcgt ttcgtaactg caccacgctg 2X840 

cggccccacc ggttctgaat caccttggcc ctgccggggt tttccttgag agcgcgctgg 21900 

ccggcttcgc tgcccacatc catttccacg acatgctcct tgttaatcat ggccagaccg 2X960 

tggaggcagc gcagctcctc gtcatcgtcg gtgcagtgat gctcccacac gacgcagcca 22020 

gtgggctccc acttgggctt ggaggcctcg gcaatgccag aatacaggag aab^tagtgg 22080 

tgcagaaaac gtcccatcat ggtgccaaag gttttctggc tgctgaaggt catcgggcag 22X40 

tacctccagt cctcgttaag ccaagtgttg cagatcttcc tgaagaecgt gtactgatcg 22200 

ggcataaagt ggaactcatt gcgctcggtc ttgtcgatct tatacttttc catcagacta 22260 

tgcataatct ccatgccctt ttcccaggcg caaacaatct tggtgotaca cgggitaggt 22320 

atggccaaag tggttggcct ctgaggcggc gcttgttctt cctcttgagc cctctoccga 22380 

ctgacggggg ttgaaagagg gtgccccttg gggaacggct tgaacacggt otggcccgag 22440 

gcgtcccgaa gaatotgcat cgggggattg etggccgtca tggcgatgat ctgaccccgg 22500 

ggctcetcca cttcgtcctc ctcgggactf tcctcgtgct tttcggggga cggtadggga 22560 

gtagggggaa gagcgcggcg cgccttcttc ttgggcggca gttccggagc ctgctcttga 22620 

egactggcca ttgtcttctc ctaggcaaga aaaacaagat ggaagactct ctctcctcct 22680 

cctcgtcaac gtcagaaagc gagtcttcca ccttaagcgc cgagaactcc cagcgcatag 22740 

aatcogatgt gggctacgag actccccccg cgaacttttc gccgcccccc ataaacacta 22800 

acgggtggac ggactacctg gccctaggag acgtactgct gaagcacatc aggcggcaga 22860 

gegttatcgt gcaagatgct ctcaccgago gactcgcggt tccgctggaa gtggcggaac 22920 

ttagcgccgc ctacgagcga accctcttct ccccaaagac tccccccaag aggcaggcta 22980 

acggcacctg cgagcctaac cctcgactca acctctaccc tgcctttgcc gtgccagagg 23040 

•tactggctac gtaccacatt tttttccaaa accacaaaat ccctctctcg tgcc^cgcca 23X00 

accgoaccaa agccgatcgc gtgctgcgac tggaggaagg ggcCcgcata cctgagattg 23X60 

cgtgtctgga ggaagtccca aaaatctttg aaggCctggg ccgcgacgaa aagV:gagcag 23220 

caaacgctct ggaagagaae gcagagagte acaaeagcge cttggbagaa ctcgagggcg 23280 

acaacgccag actggccgtc ctcaaacggt ccatagaagt caegcacttc gcctaccccg 23340 
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ccgttaacct ccctccaaaa gttatgteag cggtcatgga ctcgctigctc ataaagcgcg 23400 
ctcagccctt agacccagag cacgaaaaca acagtgacga aggaaaaccg gtggtttctg 23460 
atgaggagtt gagcaagtgg ctgticctcca acgaccccgc cacgttggag gaacgaagaa 23520 
aaaccatgat ggccgtggtg ctagttaccg tgcaattaga atgtctgcag aggttctttt 23SB0 
cccacccaga gaccctgaga aaagtggagg aaacgctgca ctacacattt aggcacggct 23640 
acgtgaagca agcctgcaag atttccaacg tagaacttag caacctcatc tcctacctgg 23700 
ggatcttgca cgaaaaccgc ctcggacaaa acgtgctgca cagcacactg aaaggagaag 23760 
cccgccgaga ctatgtgcga gactgcgtgt tcctagcgct agtgtacacc tggcagagcg 23820 
gaatgggagt: ctggcagcag tgcctggagg acgaaaacct caaagagctt gaaaagctgc 238*80 
tggtgcgctc cagaagggca ctgtggacca gtttitgacga gcgcaccgcc gcgcgagacc 23940 
nagctgatat tattttticct cccaagctgg tgcagactct ccgggaagga ctgccagatt 24000 
ttatgagtca aagcat:ct.tg caaaacttcc gctctctcat cttggaacgc tcgggaatict: 24060 
tgcccgccac tagctgcgcc ctacccacag attttgtgcc tctccactac cgcgaatgcc 24X20 
caccgccgct gtggccgtac acttfacttgc ttaaactggc caactttcta £^tgt:tccact 24180 
ctgacctggc agaagacgtt agcggcgagg ggctgctaga atgccactgc cgctgcaacc 24240 
tgtgcaccGC ccaccgctct: ctagtatgca acactcccct gctcaatgag acccagatca 24300 
tcggtacctt tgaaatccag ggaccctccg acgcggaaaa cggcaagcag g.ggtctgggc 24360 
taaaactcac agccggactg tggacctccg cctacttgcg caaatttgta ccagaagact 24420 
atcacgccca ccaaattaaa ttttacgaaa accaatcaaa accacccaaa agcgagttaa 24480 
cggcttgcgt cattacgcag agcagcatag ttgggcagtt gcaagccatt aacaaagcgc 24540 
ggcaagagtt tctcctiaaaa aaaggaaaag gggtctactt ggacccccag accggcgagg 24600 
aactcaacgg accctcctca gt:cgcaggtt. gtgtgcccca tgccgcccaa aaagaacacc 24660 
tcgcagtgga acatgccaga gacgpaggaa gaggagtgga gcagtgtgag caacagcgaa 24720 
acggaggaag ^gccgtggcc cgaggggtgc aacggggaag aggacacgga gggacggcga 24780 
agtcttcgcc gaagaactct cgccgctgcc cccgaagtcc cagccggccg ccucggccca 24840 
agatcccgca cacacccgta gatgggatag caagaccaaa aagccgggta agagaaacgc 24900 
tcgcccccgc cagggctacc gct^cgtiggag aaagcacaaa aactgcatct tatcgtgctt 24960 
gctccagtgc ggcggagacg tttcgttcac ccgtagatac ttgcttttta acaaaggggt 25020 
ggccgtcccc cgtaacgtcc tccac^acta ccgtcactct tacagctccg aagcggacgg 25080 
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ctaagaaaac gcagcagttg ccggcgggag gactgcgtct cagcgcccga gaacccccag 25140 

ccaccaggga gctccgaaac cgcatatttc ccaccctcta cgctatcttt cagcaaagcc 25200 

g99ggcagca gcaagaactg aaaataaaaa accgcacgct gaggtcgctt acccgaagct 25260 

'gcctctatca caagagcgaa gagcagctgc agcgaaccct ggaggacgca gaagcgctgt 25320 

tGcagaagta ctgcgcgacc accctaaata actaaaaaag cccgcgcgcg ggacttcaaa 25380 

ccgtctgacg tcaccagccg cgcgccaaaa tgagcaaaga gattcccacg ccttacatgt 25440 

ggagttacca gccgcagatg ggattagccg ccggcgccgc ccaggattac tccacgaaaa 25500 

tgaactggct cagcgccggg ccccacatga tttcccgcgt aaacgacatt cgcgcccacc 25560 

gcaatcagct attgttagaa caggctgctc tgaccgccac gccccgtaat aacctgaacc 25620 

ctcccagctg gccagctgcc ctggtgtacc aggaaacgcc tccacccacc agcgtacttt 25680 

tgccccgtga cgcccaggcg gaagtccaga tgactaacgc gggcgcgcaa ttagcgggcg 25740 

gatcccggtt tcggtacaga gttcacggcg ccgcacccta tagcccaggt ataaagaggc 25800 

tgatcattcg aggcagaggt gtccagetca acgacgagac agtgagctct tcgcttggtc 25860 

tacgaccaga cggagtgttc cagctcgcgg gctcgggccg ctcttcgttc acgcQtcgcc 25920 

aggcatacct gactctgcag agctctgcct ctcagcctcg ctcgggagga atcggacccc 25980 

ttcagtttgt ggaggagttt gtgccctcgg tctackttca gcctttctcc ggaecgcccg;; 26040 

gccagtaccc ggacgagttc atccceaact tcgacgcggt gagtgactct gtggacggtt 26100 

atgactgatg tcgagcccgc ttcagtgcta gtggaacaag cgcggctcaa tcacctggtt 26160 

cgttgccgcc gccgctgctg cgtggbtcgc gacttgagct tagctctcaa gtttgtaaaa 26220 

aacccgtccg aaaccgggag cgctgtgcac gggttggagc tagtgggtcc tgagaaggcc 26280 

accatccacg ttctcagaaa ctttgtggaa aaacccattt tggctaaacg agaccagggg 26340 

cctettgtaa tcagcttact ctgcacctgt aaccatgttg accttcacga ctattttatg 26400 

gatcatttgt gcgctgaatt caataagtaa agcgaattct taccaagatt atgatgtcca • 26460 

tgactgtCcc tcgccactat acgatgttgt gccagtaaac tctcttgtcg acatctatct 26520 

gaactgttcc ttttggtccg cacagcttac ttggtactac ggtgacaccg tcctttctgg 26580 

ctcactgggc agctcacacg gaataacact tcacctcttt tcgccgtttc gatacggaaa 26640 

ctacagctgt cgtgccggta cctgcctcca cgttttcaat cttcagccct gtccaccgac 26700 
caaacttgta tttgtcgact ctaagcactt acagctcaac tgcagcattc taggccccag 26760 
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tatcttgtgg acatacaata 


aaatcaggtt 


ggtggaattt 


gtctactacc cacccagcgc 


26820 


ccgcggtttt 


ggggaaattc 


ctttccagat 


ctactacaac 


tatcttgcca cacattatgc 


26880 


aagtcaacag 


caactaaact 


tgcaagcacc 


cttcacgcca 


ggagagtact cctgtcacgt: 


26940 


aggctcctgc 


acagaaactt 


ttattctctt 


caacagatct 


tctgccattg aacgcttcac 


27000 


tactaactac 


tttagaaacc 


aagttgtgct 


tttcactgac 


gaaaccccta acgtcaccct 


27060 


ggactgtgca 


tgtttttctc 


atgacaccgt 


aacttggact 


cttaacaata ctctctggct 


27120 


cgcgttcgat: 


aaccaaagct 


tgattgttaa 


aaattttgat 


ttaaccttta ctaaaccctc 


27180 


tcctcgcgaa 


atagttiatct 


titgctccttt 


taatccaaaa 


actaccttag cctgtcaggt 


27240 


tttgtttaag 


ccttgccaaa 


ca'aactttaa 


gtttgtttati 


ttgcctccgc aatctgtcaa 


27300 


actcatagaa 


aaatacaaca 


aagcgcccgt 


cttggctcct 


aaaaccttct accactggct 


27360 


aacctacacg gggctgtttg 


cactaactgt 


ttttttccta 


attaacattt: ttat^tgttt 


27420 


cttgccttcc 


tccttctttt 


cgcgaacacc 


gttgccgcag 


aaagacctct ccttattiact 


27480 


gtagcgcttg 


ctatacaaaa 


ccaagagtgg 


tcaaccgtgc 


tctcaatcta ttttcaattt 


27540 


ttcattttgt 


ccttaatact 


ttctettatt 


gtcgttaaca 


atgatctgga gcattggtct 


27600 


cgcctttttt 


tggctgctta 


gtgcaaaagc 


cactattttt 


cacaggtatg tggaagaagg 


27660 


aactagcacc cCctttacga 


tacctgaaac 


aattaaggcg 


gccigatgaag tttcttggta 


27720 


caaaggctcg ctctcagacg 


gcaaccactc 


attctcagga 


cagacccttt gcatccaaga 


27780 


aacttatttt 


aaatcagaac 


tacaatacag 


ctgcataaaa 


aactttttcc atctctacaa 


27840 


catctcaaaa 


ccctatgagg 


gtatttacaa 


tgccaaggtt 


tcagacaact ccagcacacg 


27900 


gaacttttac 


tttaatctga 


cagttattaa 


agcaatttcc 


attcctatct gtgagtttag 


27960 


ctcccagttit 


ctttctgaaa 


ccfeac^gttt 


aattactata 


aactgcacta aaaatcgcct 


28020 


tcacaccacc 


ataatccaca 


atcacacaca 


atcaccttgg 


gttttaaac'c taaaatcttc 


28080 


tccacacatg 


ccttcgcaat 


ttctcacgca 


agttaccgtc 


tctaacataa gcaagcagtt 


28140 


tggcttttac 


tatcctttcc 


acgaactgtg 


cgaaataatt 


gaagccgaat atgaaccaga 


28200 


ctactttact 


tacattgcca 


ttggtgtaat 


cgttgtttgc 


ctttgctttg ttattggggg 


28260 


gtgtgtttat 


ttgtacattc 


agag^aaaat 


attgctctcg 


ctgtgctcct gcggttacaa 


28320 


agcagaagaa agaattaaaa 


tctctacact 


ttattaatgt 


tttccagaaa tggcaaaact 


28380 


aacgctccta 


ctcttgcttc 


tcacgccggt 


gacgcttttt; 


accatcactt tttctgccgc 


28440 


cgccacactc 


gaacctcaat 


gtt.tgccacc 


ggttgaagtc 


tactttgtct acgtgttgct 


28500 
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gtgctgcgtt agcgtttgca gtataacatg ttttaccttt gtttttcttc agtgcattga 26560 

ctacttcbgg gtcagactct actaccgcag acacgcgcct cagcatcaaa atcaacaaat 28620 

tgccagacta ctcggtctgc catgattgtc ttgtatttta ccctgatttt ttttcacctt 28680 

acttgcgctt gtgattttca cttccictcaa ttttggaaaa cg'caatgctt .cgacccgcgc 28740 

ctctccaacg actggatgat ggctcttgca attgccacgc ttggggcgtt tggacttttt: 28800 

agtggttttg ctttgcatta caaatttaag actccatgga cacatggctt tctttcagat 28860 

tttccagtta cacctactcc gccgcctccc ccggccatcg acgtgcctca ggttccctca 28920 

ccttctccat ctgtctgcag ctactttcat ctgtaatggc cgacctagaa tttgacggag 28980 

tgcaatctga gcaaagggct atacacttcc aacgccagtc ggaccgcgaa cgcaaaaaca 29040 

gagagctgca aaccatacaa aacacccacc aatgtaaacg cgggatattt tgtattgtaa 29100 

aacaagctaa gctccactac gagcttctat ccggcaacga ccacgagctc caatacgtgg 29X60 

tcgatcagca gcgtcaaacc tgtgtattct taattggagt ttcccccatt aaagttactc 29220 

aaaccaaggg tgaaaccaag ggaaccataa ggtgctcatg tcacctgtca gaatgccttt 29280 

acactctagt taaaacccta tgtggcttac atgattctat cccctttaat taa.SLtaaact 29340 

tactttaaat ctgcaatcac ttcttcgtcc ttgtttttgt cgccatccag cagcaccacc 29400 

ttcccctctt cccaactttc atagcatatt ttccgaaaag aggcgtactt tcgccacacc 29460 

ttaaagggaa cgtttacctc gctttcaagc tctcccacga ttttcattgc agatatgaaa 29520 

cgcgccaaag tggaagaagg atttaacccc gtttatccct atggatattc tactccgact 29580 

gacgtggctc ctccctttgt a^cctctgac ggtcttcaag aaaacccacc tggggtcttg 29640 

tccctaaaaa tatccaaacc tttaactttt aatgcctcca aggctctaag cctiggctatt: 29700 

ggtccaggat taaaaatitca agatggtaaa ctagtggggg agggacaagc aattcttgca 29760 

aacctgccgc ttcaaatcac caacaacaca atttcactac gtttcgggaa cacacttgcc 29820 

ttgaatgaca ataaCgaact ccaaaccaca ctaaaatctt catcgcccct taaaatcaca 29880 

gaccagactc tgtcccttaa cataggggac agccttgca^ ttaaagatga caaactagaa 29940 

agcgctcttc aagcgaccct cccactctcc attagcaaca acaccatcag cctcaacgtg 30000 

ggcaccggac tcaccataaa tggaaacgtt ttacaagctg ttcccttaaa tgctctaagt 30060 

cccctaacta tttccaacaa taacatcagc ctgcgctatg gcagttccct gacggtgctt 30120 

aacaatgaac tgcaaagcaa cctcacagtt cactcccctt taaaactcaa ' ctccaacaac 30180 
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tcaatttctic 


tcaacactct atctccgttt 


agaatcgaga 


atggtttcct 


cacgctctat 


30240 


ttgggaacaa 


aatctggctt gctagttcaa 


aacagtggct 


tiaaaagttca 


agcgggctac 


30300 


ggcctgcaag 


taacagacac caatgctctc 


acattaagat 


atctcgctcc 


actgaccatt 


30360 


ccagactcgg 


gctcagaaca aggcattctt 


aaagtaaaca 


ctggacaggg 


cctaagtgtg 


30420 


aaccaagctg 


gagcgcttga aacatcccta 


ggaggtggat 


taaaatatgc 


tgataacaaa 


30480 


ataacctttg 


atacaggaaa cggactgaca 


ttatctgaaa 


ataaacttgc 


agtagctgca 


30S40 


ggtagtggtc 


taacttttag agatggtgcc 


ttggtagcca 


cgggaaccgc 


atttacgcaa 


30600 


acactgtgga 


ctacggctga tccgtctccc 


aactgcacaa 


ttatacagga 


ccgcgacaca 


30660 


aaatttactit 


tggcgctnac cattagtggg 


agccaagtgc 


tggggacggt 


ttccattatt: 


30720 


ggagtaaaag 


gccccctttc aa^gtagcata 


ccgccagcta 


ccgttacagt 


acaacttaac 


30780 


tttgattcca 


acggagccct attgagctcc 


tcttcactta 


aaggttactg 


ggggtatcgc 


30840 


caaggtccct 


caattgaccc ttaccccata 


attaatgcct 


taaactttat 


gccaaactca 


30900 


ccggcttatc 


ccccgggaca agaaatccaa 


geaaaatgta 


acatgtacgt 


ttctacctti: 


30960 


ttacgaggaa 


atccacaaag accaatagtt 


ttaaacatca 


cttttaataa 


tcaaaccagc 


31020 


gggttttcca 


ttagatttac atggacaaat 


ttaaccacag 


gagaagcatt 


tgcaatgccc 


31080 


ccatgcactt 


tttcctacat tgctgaacaa 


caataaacta 


tgtaaccctc 


accgttaacc 


31140 


cgcctccgcc 


ctieccatttt aetttacaaa 


ccacccgatc 


caccttttca 


gca'gtaaaca 


31200 


attgcatgtc 


agtaggggca gt,a^acttt 


tgggagttaa 


aatccacaca 


ggttcttcac 


31260 


aagctaagcg 


aaaatcagtt acacttataa 


aaccatcgct 


aacatcggac 


aaagacaagc 


31320 


atgagtccaa 


agcttccggt tctggatcag 


atttttgttc 


attaacagcg 


ggagaaacag 


31380 


cttctggagg 


attttccaitc tccatctcct 


tcatcagttc 


caccatgtcc 


accg.tggtca 


31440 


tctgggacga 


gaacgacagt tgtcatacac 


ctcataagtc 


accggtcgat 


gacgaacgta 


31500 


cagatctcga 


agaatgtcct gtcgccgcct 


ttcggcagca 


ctgggccgaa 


ggcgaaagcg 


31560 


cccatgttta 


acaatggcca gcacogcccg 


cttcatcagg 


cgcctagttc 


ttttagcgca- 


31620 


acagcgcatg 


cgcagctcgc taagactggc 


gcaagaaaca 


cagcacagaa 


ccaccagatt 


31680 


gttcatgatc 


ccataagcgt gctgacacca 


gcccatacta 


acaaattgtt 


tcactattct 


31740 


agcatgaatg 


tcatatctga tgtbcaagta 


aattaaatgg 


cgccccctta 


tgtaaacact 


31800 


tcccacgtac 


aacacctcct tkggcatctg 


ataattaacc 


acctcccgat 


accaaataca 


31860 


tctctgatta 


atagtcgccc cgtacactac 


ccgattaaac 


caagttgcca 


acataatccc 


31920 
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ccctgccata cactgcaaag aacctggacg gctacaatga cagtgcaaag tccacaccCc 319B0 

gttgccatgg ataactgagg aacgccttaa gtcaatagtg gcacaactaa tacaaacatg 32040 

taaatagtgt ttcaacaagt gccactcgta tgaggtgagt atcatgtccc agggaacggg 32100 

ccactccata aacactgcaa aaccaacaca tcctaccatc ccccgcacgg cactcacatc .32160 

gtgcatggtg ttcatatcac agticcggaag ctgaggacaa ggaaaagtct cgggagcatt 32220 

ttcatagggc ggtagtgggt actccttgta ggggttcagt cggcaccggt atctcctcac 32280 

cttctgggcc ataacacaca agttgagatc tgatttcaag gtactttctg aatgaaaacc 32340 

aagtgctttc ccaacaatgt atccgatgtc ttcggtcccc gcgtcggtag cgctccttgc 32400 

agtacacacg gaacaaccac ccacgcaggc ccagaagaca gttttccgcg gacggcgaca 32460 

agttaatccc cctcagtctc agagccaata tagtttcttc cacagtagca taggccaaac 32520 

ccaaccagga aacacaagct ggcacgtccc gttcaacggg aggacaagga agcagaggca 32560 

gaggcatagg caaagcaaca gaatttttat tccaactggt cacgtagcac ttoaaacacc 32640 

aggtcacgta aatggcagcg atcttgggtt tcctgatgga acataacagc aagatcaaac 32700 

atgagacgat tctcaaggtg ateaaccaca gctggaatta aatcctccac gcgcacattt 32760 

agaaacacca gcaatacaaa agcccggttt tctccgggat ctatcatagc agcacagtca 32820 

tcaattagtc ccaagtaatt ttcccgtttc caatctgtta taatttgcag aataatgccc 32880 

tgtaaatcca agccggccat ggcgaaaagc tcagataatg cactttccac gtgcattcgt 32940 

aaacacaccc tcatcttgtc aatccaaaaa gtcttcttct tgagaaacct gtagtaaatt 33000 

aagaatcgcc aggttaggct cgatgcctac atcccggagc ttcattctca gcatgcactg 33060 

caaatgatcG agcagatcag aacagcaatit agcagccagc tcatccccgg tttccagttc 33120 

cggagttccc acggcaatta tcactcgaaa cgtgggacaa atcgaaataa catgagctcc 33180 

cacgtgagca aaagccgtag ggccagtgca ataatcacag aaccagcgga aaaaagatcg 33240 

cagctcatgt ttcaaaaagc tctgc^gatc* aaaattcagc tcatgcaaat aacacagtaa 33300 

agnttgcggt atagtaaccg aaaactacac ' gggtcgacgt tcaaacatct ' cggcttacct 33360 

aaaaaagaag cacattttta aaccacagfcc gcttcctgaa caggaggaaa t^tggtgcgg 33420 

cgtaaaacca gacgcgccac cggatctccg gcagagccct gataatacag ccagctgtgg 33480 

ttaaacagca aaacctttaa ttcggcaacg gttgaggtct ccacataatc agcgcccaca 33540 

aaaatcccat ctcgaacttg ct'cgcgtagg gagctaaaat: ggccagtata gccccatggc 33600 
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acccgaacgc taatctgcaa gtatatgaga gccaccccat tcggcgggat cacaaaatca 33660 

gtcggagaaa acaacgtata caccccggac tgcaaaagct gttcaggcaa acgcccctgc 33720 

ggtccctctc ggtacaccag caaagcctcg ggtaaagcag ccatgccaag cgcttaccgt 33780 

gccaagagcg actcagacga aaaagtgtac tgaggcgctc agagcagcgg ctatatactc 33840 

tacctgtgac gtcaagaacc ga^agtcaaa agttcacccg gcgcgcccga aaaaacccgc 33900 

gaaaatccac ccaaaaagcc cgcgaaaaac acttccgtat aaaatttccg ggttaccggc 33960 

gcgtcaccgc cgcgcgacac gcccgccccg ccccgcgctc ctccccgaaa cccgqcgcgc 34020 

ccacttccgc gttcccaaga caaaggtcgc gtaactccgc ccacctcatt tgcacgttaa 34080 

ctcggtcgcc atcttgcggt gttatattga tgatg 34115 

<210> 7 

<211> 44 

<212> DNA 

<213> Artificial \ 
<220> 

<223> primer P5Ii 

<400> 7 

gcgcacgcgt ctctatcgat gaattccatt ggtgatggac atgc 44 

<210> 8 

<211> 36 . ■ 

<2X2> DNA 

<213> Artificial 

<220> 

<223> primer PSITR 
<400> 8 

gcgcatttaa atcatcatca ataa.tatacc tcaaac 36 

<210> 9 

<211> 31 

<212> DNA • 

-:213> Artificial ' . 

<220> 

<223> primer P5XT0P 

<400> 9 

gatacctagg aacgaggagg at'ttgatatt g 31 

<210> 10 . • 

<211> 20 



124 



20 



<212> DNA 
<213> Artificial 

<220> 

<223> primer P5XB0T 

<400> 10 

atgtacgcct ccgcgctcac 

<210> 11 

<211> 31 • . 

<212> DNA 

<213> Artificial 

<220> 

<223> primer P5E4 
<400> 11 

gatcgaattc ccactctgta cccca.tctct g -31 

<210> 12 

<211> 31967 

<212> DNA 

<213> Simian adenovirus 

<220> . * ••* 

<221> CDS 
•<222> (13796) (15322) 

<220> 

<221> CDS 

<222> (18257) . . (21010) 
<220> 

<221> CDS 

<222> (27192) . . (29015) 
<400> 12 

catcatcata atatacctta tttgggaacg gtgccaatat gataatgagg aggcggggtt 60 
aggggtggag tgagggtggg gtgcggatga cgcgggcgcg gggcggggtg ggagtctgac 120 
gtggggcgcg gggtggagcg cgagggtgag ggcggggcga gggcggcggg cgcggcggaa . 180 
ttgacgtaca cggtagtaag tttgagcgga aattaagtga attgggcgtg ttttttgtaa 240 
ctttttgacg tacacggtag taagtttgag cggaaattaa gtgaattggg cgtgtttttt 300 
gtaacttttt gacgtacacg gtagtaagtt tgagcggaaa ttaagtgaat tgggcgtgtt 360 
ttttgtaact ttttgacgta cacg9tagta agtttgagcg gaaattaagt gaattgggcg 420 
tgttttttgt aactttttga cgtacacggt agtaagtttg- agcggaaatt aagtgaattg 460 
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ggcgtgtttt ttgtaacttt ttgacgtaca cggtagtaag tttgagcgga aattaagtga S40 

attgggcgtg ttttttgtaa ctttttgacg tacaoggtag taagtttgag cggaaattaa 600 

gtgaattggg cgtgtttctc gtaacttttt ggtcattttg gegcgaaaac t;gagtaatga 660 

ggaagtgaga cggactctgc . ccttttttac ggttgggagg gaaaactgct gatcagcgct 720 

gaactttggg ctctgacgcg gtggtttccc tacgtggcag tgccacgaga aggctcaaag 780 

tcctcgtttt attgtgtgct cagccttttc gagggtattt aaacaccgtc age^ccgccaa 840 

gaggccactc ttgagtgcga gcgagtagag ttttctcctc egtcgebgcc gcggctgctc 900 

agtcttaccg ccaggatgcg aatgctgccg gagatcttca ccgggtcctg ggaafiatgct 960 

ttccagggac ttttagaatc tgaagacaac tttccccaac ctictgagcc ggaggagcta 1020 

cctgaggttt cgcttcacga tctgtttgac gtggaggtgg agagecccga egg^gatccg 1080 

aacgaggaag ctgttgatgg tatgttcccc gactggatga tatctcagag cgagagtgct 1140 

gaaggcagtg cggactcggg cgtt.tctggg gttggaaacc tggtggaggt ggatctggac 1200 

ttgaagtgtt aegaggaagg ttttcotcct agcgactcag agactgatga agc.ctcagaa 1260 

Scggaaggtc aagaggagtc tgtgtgtggt tatgtgaaga ttaatgaggg ggagaacctg 1320 

ctggtgttgg actgtccgga ccaacctgga catggctgtc gagcctgtga ctttcaccgg 1380 

gggaccaslcg gaaacccgga algctatctgt gctttgtgct acatgcgtct gaacgagcac 1440 

tgcatataca gtgagtgtta ttcatgggtt atttatgggg aaagttgggg gaaagtcctg ISOO 

agaaggggaa aagtttaaca tgtcattttt gtacttgata ggtccagttt dggacgctga 1560 

gggggattct gagtcccctg ctggtccttc ccagccctca ccctgctctc tgaqcgccac 1620 

gcccgcacct gacctagtta gaccaacgcc ctgccgagtg tcctgtagac gacg&gcagc 1680 

tgttaattgc atagaagatt tattggccco tgatgacgag aacgcacctt tgaacctgtg 1740 

cctgaaacgc cctaagacat cttga^tgtt tatgctgtta ataaaagcgt tgacccttag 1800 

atcctgtgtt tattccttgg gcgtgtgcgc gggtatataa agcagctgcg ggctggagtg i860 

ttagtttatt ctgatggagt actggagtga gctgcagaat taccagagcc tccggcgcct 1920 " 

gctggagttg gcctctgcca gaacatccac ctgctggagg ttctgttttg gctcgactct 1980 

cagtaacgtg gtgtatoggg tg^atrcaaga gtacagctcg cgcttttctg agctgttggc 2040 

ccgctacccg gctgtttttg tttctctgga tctaggccat cacgtttatt tccakgaagc 2100 

tgtagtcaga tatttggatt tttctactcc cgggcgtgcg gtttctgcga ttgccttcat 2160 

ctgctttgtg ctagatcgat ggagcgcoca aacccgcctg agcccggggt acaccotgga 2220 
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ctacctgacc atgtccctgt ggagggccat gctgcggaag aggagggtct caggcttctc 2280 

gccggcgcgg cctccgcacg gaotggatcc ggtgotggag gagtcggagc tggaggagga 2340 

ggagaacccg agggccggoc tggaccctcc ggcggaatag tgacggaace ggaggatccc 2400 

caagagggta ctagtcaggg gggagggggg ccgaagagaa agigggatga agaggaggcg 2460 

atggaccccg acaggtttct aaaagaactg actttaagct taatgtctaa gagaagaccc 2520 

gagacggtgt ggtggtctga tttggagaag gagttccacc agggggagat gaatctgttg 2S80 

tacaagtatg ggtttgagca ggcgaagaet cactggetgg aagcctggga ggaetgggag 2fi40 

atggctttta acatgtttgc caaggtggcg ctgcgcccgg acaetattta caccgtgact 2700 

aagacggtgg aaatcogcaa gcctgtgtat gtgattggca acggggccgt ggttcggttc 2760 

cagaceaccg accgggtggc cttUattgc tgtatgcaga acctgggccc gggggtgatt 2820 

aatcttaatg gagtgacctt ttgcaatgtc agattcgcgg gggatggatt caacgggacg 2880 

gtgttcgecg ccaccaccca gataacccta cacggggtgt tcttccagca tgtaggcggg 2940 

gcttgcgtag atacctgggc gagggcctcc gtgaggggct gcacctttgt gggctgttgg 3000 

aaagcggtgg.tgggtcgacc ciagagtgtg ctgtctgtga agaaatgtgt gtttgagaga 3060 

tgtctgatgg ccatggtggt ggagggccag ggtaggatcc gccataacgc gggbtccgag 3120 

aatacctgtt ttgccctgct gaagggtacg gcgaccgtga agcataaoat gatctgcggg iiao 

gtgggtcact ogcagctgct gacct^tgcg gatggcaact gccaggccct gcgcacggtg 3240 

catgtggtgt cooaccggcg ccgcccctgg ccggtgcttg aacataacat gctgatgcgc 3300 

tgtaccatgc acctgggcta ccgccgcggc gtgtttgtgc cccatcagtg taacctgacc 3360 

cacaccaagg tgttgctgga gacggatgct ttttcgcgag tgaatctgaa tggggtgttc 3420 

gatctgacta tggagatgta caigatagtg agatttgatg aatcaaagac cegttgtcgc 3480 

ccctgcgagt goggcgccaa tcacctgagg atgtatcccg tgaecctgaa cgtgacggag 3540 

gagctgcgcc cggaccacca gatgctgtcc tgtctgcgca ccgattaega aagcagcgat 3600 

gaggattaag aggtgagggg cggggcttgc atggggtata aaggcggggg aggaggtggg 3660 

9asggggaaa ecccaaaatg agcggatcga tggaagggag cgctgtgagt tttgagggcg 3720 

gggtgttcag cccatatctg acaacccgtc tccocgcctg ggcaggagtg cgtcagaatg 3780 

tggtgggctc caacgtggac ggac&tccgg tggcccctgc caactccgco actctcacct 3840 

acgccaccgt cggatcgtcg ctggacaccg ccgctgccgc cgccgcttca gccgccgctt 3900 
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ctactgctcg cggtatggca gctgatttcg gactgtatca gcaactggct gjcgcctcgct 3960 

cgtcgctgag agaagatgat gccctgtccg tggtgctgac ccgcctggag gagctgtccc 4020 

agcagctgca agagctgtct gccaaagtgg atgcacagaa cgtccccgct acccaatgaa 4 080 

taaataaacg agacaccgag tgtgtttgga aatcaaaatg tgtttttatt tgttttttct 4140 

ggcgcggtag gcccttgacc acctgtcgcg gtcgttaagg accttgtgga tgttttccag 4200 

cacccggtag aggtgggctt ggatgttgag gtacatgggc atgagcccgt ctcgggggtg 4260 

gaggtagcac cactggaggg cgtcgtgctc gggggtggtg ttgtagataa tccagtcgta 4320 

gcagggtttt tgggcatgga agcggaagat gtctttgaga agcaggctga tggccagggg 4380 

gaggcccttg gtgtaggtgt tcacaaagcg gttgagctgg gagggatgca tgcgggggga 4440 

gatgagatgc atcttggcct gaatcttgag gttggcgatg ttgccgccca gatcccgccg 4500 

ggggctcatg ttgtgcagga ccaccaggac ggtgtagccg gtgcacttgg ggaatttgtc 4560 

atgcaacttg gaagggaagg cgtggaagaa cttggagacc cccttgtggc cgccgaggtt 4620 

ctccatgcat tcgtccatga tgatggcgat gggacccctg gcggccgccc tggcgaagac 4 680 

gttgtcgggg tgggagacgt cgtagttctg ttccagggtg agctcgtcgt aggcbatttt 4 740 

gacgaagcgg gggagcaggg. tgctcgactg ggggacgatg gtaccttcgg gacccggggc 4 800 

gtagttgccc tcgcagattt gcatctccca ggccttgatc tccgaggggg. ggatcatgtc 4860 

cacctggggc gcgatgaaga agacggtctc cggggcgggg ttgatgagct gggaggagag 4920 

gaggttgcgg agcagctgcg acttgccgca cccggtgggc ccgtagatga ccccgatgac 4 980 

gggttgcagc tggtagttta aggag[ct:gca gctgccgtcc tcgcgcagga acggggcgac 5040 

ctcgttcatc atgcttctga cgtgatggtt ttccctgacg aggtcttgca agagccgctc 5100 

gccgcccagg gagagaagct cttccaggct gcggaaatgc ttgaggggtt tgaggccgtc 5160 

ggccatggtc atctttticca gggactggcg gagcaggtac aggcggtccc agagctcggt 5220 

gacgtgttct acggcatctic gacccagcag acttcttggt tgcgggggtt ggg[gcggctt: 5280 

tggctgtagg ggaccagccg gtgcgcgtcc agggaggcga gggtgacgtc tttccagggc 5340 

cgcagcgttc gcgtgagggt ggtctcggtg acggtgaagg gacgcgctcc cggtcgggcg 5400 

ctggccaggg tcctcttgag actcatcctg ctggtgtgga agcgggcgtc ttctccctgg 5460 

gagtcggcca ggtagcattt gagqatgagg tcgtagctga gggcctcggc cgcgtggccc 5520 

ttggcgcgca gcttgccttt ggagacgtgt ccgcaggcgg gacagtgcag gcacttgagg 5580 

gcgtagagct tgggggccag gaagacggac tcgggggagt aggcgtcggc gccgcactga 5640 
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gcgcacgtgg tctcgcactc gacgagccag gtgagctccg ggtgttgggg a&caaaaacc SiOO 

agctggcccc cgtgtttttt gatgcgcttc ttacctcggg tctccatgag gcg^cgtccg 5760 

gcttcggtga cgaagaggct gtcggtgtcg ccgtagacgg atttgagcgc gcgctgctcc 5820 

a9999adtcc cgcgatcctc cgcgtgcagg aactcggacc actctgagac gaaggcccgg 5880 

gtccacgcga ggacaaagga ggcgatctgg gacgggtagc ggtcgttctc caccagggga 5940 

tccaccttct ccagggtgtg caggcagagg tcgtcctcct ccgcgtccat gaaggtgatt 6000 

ggcttgtaag tgtatgtcac gtgaccgtcg gggtcgcgcg tgggcttata aaagggggcg 6060 

tgcccggcct ccccgtcact ttcttccgca tcgctgtgga cgagatccag ctgctcgggt 6120 

gagtaggcgc gctggaaggc gggcatgacc tcggcgctga gggtgtcagt ttccacgaac 6180 

gaggtggatt tgatattgac ctgtccggcg gcgatgcttt tgacggtggc ggggtccatc 6240 

tggtcagaaa agacgatctt tttgttgtcc agcetggtgg cgaacgaccc gtagagggcg 6300 

ttggagagca gcttggcgat ggagcgcagg gtctggttct tctcgcggtc ggcgcgctcc 6360 

ttggoggcga tgttgagctg gacgta'ctcg cgggccacgc agcgccattc ggggaagacg 6420 

gtggcgcgct cgtccggcag gaggcgcacg cgccagccgc ggttgtgcag ggtgatgagg 6480 

tccacgctgg tggccacctc gccgcgcagt ggctcgttgg tccagcagag gcgcccgccc 6540 

ttgcgcgagc agaagggggg caggacgtcg agctggtcct ccgcgggggg gtcggcgtcg 6600 

atggtgaaga tgcccggtag caggtggcgg tcgaagtagt cgatggcgac cgcggggtcg 6660 

gcgagggcgc gttcccagtc cctgaccgcc agggcgcgct cgtaggggtt gaggggcgcc 6720 

ccccagggca tgggatgggt gagggccgag gcgtacatigc cgcagatgtc gtagacgtag 6780 

aggggctcgc ggageacgcc gaggtc^ggtg ggatagcagc gtccgccgcg gatgctggcg 6840 

cgcacgtagt cgtacatctc gtgcgagggg gcgaggaggc cgcctccgag gtcgccgcgc 6900 

tgcggtctga cggcccggta ggtgacctgg cggaagatgg cgtgcgagtt ggaggagatg 6960 

•gtgggccgct ggaagatgtt gaagctggcc tcggggagtc cgacggcgtc gtggacgaac 7020 

tgggcgtagg agtcgcgcag cttctgcacg agcgcggcgg tgacgagcac gtccagggcg 7080 

cagtagtcga gggtctcgcg gacgaggtcg taacggggct cttgcttctt ttcccagagt 714 0 

tcgcggttga ggaggtactc ctcgcgatcc ttccagtact cttcggccgg aaagccgcgt 7200 

tcgtccgcca ggtaagaacc cagcatgtag aagcggttga cggctcggta gggacagcag 7260 

cccttctcga cgggcaggga gtaggcctgc gcggccttcc tgagcgaggt gtgggtgagg 7320 
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Scgaaggtgt cgcgcaccat: gacicttgagg aacbggaacc tgaagtcggt gtcgtcgcag 7380 

gcgccccgct cccagagccc gtagtcggtg cgtttctggc tgcgggggtt gggcagggcg 7440 

aaggtgacgt cgttgaagag gatcttgccg gcgcgcggca tgaagttgcg ggtgatcctg 7500 

aagggccccg gcacgtccga gcggttgtta atgacctggg ccgcgaggac gatctcgtcg 7560 

aagccgttga tgttgtggcc gacgatgtag agctcgacga agcgcgggcg cccctgcagc 7620 

ttgggggcct tcttgagctc ctcgtaggtg aggcagtcgg gcgagtagag gcccagctcc 7680 

tgtcgggccc attcggccac ctgggggttg gcttgcaaga agccccgcca gagctgcagg 7740 

gcgagctggg tctggaggcg gtcgcggtag tcgcggaact ttttgcccac cgccatcttc 7800 

tcgggggtga ccacgtagaa ggtgcggccg tcctggcccc aggcgtccca gttctgctcg 7860 

cgggcgagac ggcaggcctc ctcgacgagg gcctcctccc cggagagatg catgactagc 7920 

atgaagggga cgagttgctt gccgaaggca cccatccacg tgtaggtctc tacgtcgtag 7980 

gtgacgaaga gacgttcggt gcgaggatgc gagccgagag gaaagaagtt gatctcctgc 8040 

Caccagccgg aggagtgggc gttgacgtgg tggaagtaga agtcacgccg gcggaccgtg 8100 

cattcgtgct gatatttgta aaagcgggcg cagtactcgc agcgctgcac gctctgcact 8160 

tcctgaacga gatgcacccg gcgcccgcgc accaggaggc ggagggggca gtccagtgga 8220 

gcttcggcgc gctgtccttc agcctcgtca tgctcttctg cacctgcacg ctcctgctgt 8280 

gggtggagga cggagggagt gadgacgccg cgcgagccgc aggtccagat gtcgkcgcgc 8340 

ggcggcctga ggcfccagcgc cagggtgcgg atctgagcgg cgtccaggga gtcgaggaag 8400 

gcctcgctga ggtcgacggg cagcgtccgc cggtggactt gcaggagacg ggtaagggcc 8460 

ggcgccaggc gctgatggta cttgatctcg agcggttcgt tggtggaggt gtcgatggcg 8520 

tagagcaggg cctgaccgcg ggcggpgacg atggtgccgc ggtgccggcg gtagg^iggcg 8580 

tattcggggg ggctcgttac atcacccgcc tgggcctggc gccgggcggc agcgggggtt 8640 

ctggtcccgc cggcatgggc ggcagcggca cgtcggcgcg gggctccggc agcggctggt 8700 

gctgagctcg cagctgactg gcgtgcgcga cgacgcggcg gttgaggtcc tggatgtgcc 8760 

tccgctgcgt gaagaccacc ggtc'cccgga ctcggaacct gaaagagagt tqgacagaat 8820 

caatctcggc atcgttgacg gccgcdtgac gcaggatctc ctgcacgtcg cccgagttgt 8880 

cctggtaggc gatctcggac atgaactggt cgatctcttc ctcctggagt tcgccgcgtc 8940 

cggcgcgttc gacggtggcc gcgaggtcgt tggagatgcg agccatgagc tgggagaagg 9000 

cgtcgaggcc gttctcgttc cacacgcgac tgtagacgac gttgccgacg gcgtcccggg 9060 
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cgcgcatgac cacctgcgcg acgttgagct ccacgtgtcg cgcgaagacg gcgtagttgc 9120 

gcaggcgctg gaagaggtag ttgagggtgg tggcgatgtg ctcgcagacg aagaagtaca 9180 

tgacccagcg gcgcagcgtc atctcgttga tgtctccgag ggcttccaag cgctccatgg 9240 

cctcgtagaa gtcgacggcg aagttgaaga actgggagtt gcgcgccgcg accgtcagct 9300 

cgtcttgcaa gagccggatc agctgggcca cggtctcccg cacctcgcgt tcgaaggccc . 9360 

ccggcgcttc ttcctcctct ggttcctcgg cggcctcttc ttccatgacg gcttcctctt 9420 

cctccggttc ctcgggcacg ggcctccggc ggcgacggcg cctgatgggc aggcggtcca 9480 

cgaagcgttc gatgatctct ccgcggcggc ggcgcatggt ttcggtgacg gcgcggccgt 9540 

tctctcgggg ccgcagttcg aagacgcccc cgcgcaggcc gccggcgccg ccgagagggg 9600 

gcaggaggtg ggggccttcg ggcagcgaga gggcgctgac gatgcaccgt atcatctgtt 9660 

gcgtaggtac agctctccag gagtcgttga gcgagtccag ttggacggga tccgagaact 9720 

tttcgaggaa agcttcgatc caatcgcagt cgcaaggtaa gctgaggacg gtgggatgag 9780 

gggcttggcg ggaggcggag gcggcagaag aggaggagga gggcaggctg gsiggtgatgc 9840 

tgctgatgat gtaattgaag taggcggttt tcaaacggcg gatggtggcg aggaggacga 9900 

cgtctttggg cccggcctgc tggatgcgca ggcggtcggc catgccccag gcgtggctct 9960 

ggcatcggcg caggtccttg tagtagtctt gcatgagtct: ctcgacgggg acgtcgtctt 10020 

cgtcggcccg gtcggccatg cgggtggagc cgaacccgcg caggggctgc agcagggcca 10080 

ggtcggcgac cacgcgttcg gccagcacgg cctgctggat ctgggtgagc gtggtctgga 10140 

agtcgtccag gtccacgaag cggtggtagg agcccgtgtt gatggtgtag gtgcagttgg 10200 

ccatgacgga ccagttgacg acttgcatgc cgggctgggt gatctcggtg tagcggaggc 10260 

gcgagtaggc ccgcgactcg aagacgtagt cgttgcaggt gcgcacgagg tactggtagc 10320 

cgacgaggaa gtgcggcggc ggctcgcggt agaggggcca gcgcacggtg gcgggggcgc 10380 

• cgggggccag gtcctccagc atgaggcggt ggtagtggta gacgtagcgc gagagccagg 10440 

tgatgccggc ggc^gaggtg gcggcgcggg cgaagtcgcg gacgcggttc cagatgttgc 10500 

gcaagggggc gaagcgctcc atggtgggca cgctctggcc ggtgaggcgg gcgcagtcct 10560 

gcacgctcta gacgggacag agagcgggag gttagcggct ccgctccgtg gcctggggga 10620 

cagaccgcca gggtgcgacg gcggggaacc ccggttcgag accggctgga tccgtccgtc 10680 

cccgacgcgc cggccccgcg tccacgaccc caccagaggc cgagacccag ccgcggtgcc 10740 
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cggaccccag atacggaggg gagccttttt gtggtttttt cccgtagatg catccggtgt 10800 
tgcgacagat gcgtccgtcg ccagcgccgc cgacgcagcc gccgctcccg ccccccacta 10860 
gcgccgcgga ggctctgtcc ggcggccgcg gcgacccgga ggaggaggcc atcctcgact 10920 
tggaagaagg cgagggcctg gcccggctgg gagcgccctc ccccgagcgc ccLtccccgcg X0980 
tgcagctggc gagagactcg cgccaggcct acgtgccgcc gcagaatctg ttcagggacc 1X040 
gcagcggcca ggagcccgag gagatgaggg accgcaggtt tcacgcgggg cgggagctgc lllOO 
gcgcgggctt cgaccgtcgg cgggtgttgc gcgccgaaga cttcgagccc gacgagcgca 11160 
gcggagtaag tccggcacgg gcgcacgtgt cggcggccaa cctggtgacc gcgtacgagc 11220 
agacggtgaa cgaggagcgg agctbtcaga aaagcttcaa caaccacgtg cgcaccctga 11280 
tcgcgcgcga ggaggtggcc atcggcctga tgcatctgtg ggactttgtg gaggcgtacg 11340 
tgcagaaccc gtcgagcaag ccgctfgacgg cgcagttgtt cctgatcgtg cagcacagtc 11400 
gggacaacga gacgttccgc gaggcgatgc tgaacatcgc ggagcccgag ggccgctggc 11460 
tcttggacct gattaacatc ctgcagagca tcgtggtgca ggagcgcagc ctgagcctgg 11520 
ccgacaaggt: ggcggcca&c aactacagca tgttgagcct gggcaagttt tacgcccgca 11580 
agatctacaa gagcccctac gtgccc'atag acaaggaggt gaagatcgac agcttttaca 11640 
tgcggatggc gctgaaagtg ctgacgctga gcgacgatct gggggtgtac cgcaacgacc . 11700 
gcatccacaa ggccgtgagc gccagGcgcc ggcgcgagct gagcgaccgc gagctgatgc 11760 
acagcctgcg gagggcgctg gcgggcgccg gcggcggcga ggaggccgag tec tact teg 11820 
acatgggggc ggacttgcag • tggcagccca gcgcgcgggc cctggaggcg gcgggctacc 11880 
gcggcggcgg cggcgtggtc gaggcggagg acgaggacga ggtggagtac gaggaggagg 11940 
actgatcggc gaggtgtttt cgtagatgca gcgcgcgacg gcggcggcga gcgggccgca 12000 
99999ACCCC gccgtgctgg cggccctgca gagccaacct tcgggcgtga acgcctccga 12060 
tgactgggcg gcggccatgg accgcs^tttt ggccttgacc acccgcaacc ccgaggcctt 12120 
tagacagcag ccgcaggcca accgcttttc ggccatcttg gaagccgtgg tgccctcgcg 12180 
caccaacccc acgcacgaga aggticctggc ggtggtgaac gcgctgctgg agagcaaggc 12240 
gatccgcaag gacgaggcgg ggctgattta caacgccctg ctggagcggg tggcgcgcta 12300 
caacagcacc aacgtgcagg ccaacctgga ccgtctgacg acggacgtgc gggaggcggt 12360 
ggcgcagcgg gagcgcttca tgcgcgacac gaacctgggc tcgcaggtgg ccctgaacgc 12420 
cttcctgagc acgcagccgg ccaacgtgcc gcgcgggcag gaggactacg tcagtttcat 12480 
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cagcgcgctg cgcctcctgg tggccgaggt gccgcagagc gaggtgtacc agtcgggtcc 12540 

ggactacctc ttccagacct cgcggcaggg cctgcagacg gtgaacctga cgcaggcctt 12600 

caagaacctg gaaggcatgt ggggcgtgcg ggcccccgtg ggcgaccggg cgacgatctc 12660 

cagcttgctg acgccgaaca cgcggctgct gctgctgctg atcgcgccct tcaccaatag 12720 

cagtaccatc agccgcgact cgtacctggg ccacctgatc acgcfcgtacc gcgaggccat 12780 

cgggcaggcg caggtggacg agcagacctt ccaggagatt acgagcgtga gccgggccct 12840 

ggggcagcag gacacgggta gcptggaggc gacgctgaat tttctgctga ccaaccggcg 12900 

gcagaagatc ccctcccagt acacgctgag cacggaggag gagcgcatct tgcgctacgt 12960 

gcagcagtcc gtgagcctgt atctgatgcg cgagggggcg agcccctcgt cggcgctgga 13020 

catgacggcc cgtaacatgg agccgtcgct gtacgcggcc caccggccgt tcgtgaaccg 13080 

cctgatggac tacctgcacc gcgccgccs[c catgaacggc gagtacttta cgaacgccat 13140 

cctgaacccg caccggatgc cgccgtccgg cttctacacg ggggactttg acatgcccga 13200 

gggcgacgac gggttcctgt gggacgacgt gtcggacagc gtgttcgcgc cggtgcgtcc 13260 

gggcaagaag gagggcggcg acgagctgcc gctgtccgtg gtggaggcgg cgtcgcgcgg 13320 

ccagagcccg tcccccagcc tccc^tcgtt gtcggcgagc .agcagcagcg gccgggtctc 13380 

gcgcccgcgg ctggagggcg actacctgaa cgacccgctg ctgcgccccg cccggcccaa 13440 

gaactttccc aacaacgggg tggagagcct agtggataag atgaatcgct ggaagaccta 13500 

cgcccagcfag cagcgggagt gggaggagag tcagccccgc cccctgcctc cgpcgcgctc 13560 

caggtggcgc cggcgggaag aagacccgga agactcggcg gacgatagca gcgtgttgga 13620 

cttggggggg accggtgccg cctcgacaaa cccgttcgcc cacctgcgcc cgcagggccg 13680 

gctgggtcgg ctgtattgag gaaagaaact aataaaagaa aaaagagctt gcttaccaga 13740 

gccatggtcg cagcgtcggt ccctttgtgt gtgttttctc ctccccggta gcg.aa atg 13798 

Met 
, X 

agg cgc gcg gtg gga gtg ccg ccg gtg atg gcg tac gcc gag ggt cct 13846- 
Arg Arg Ala Val Gly Val Pro Pro Val Met Ala Tyr Ala Glu Gly Pro 
5 • .* 10 15 • 

cct cct tct tac gaa acg gtg atg ggc gcc gcg gat teg ccg gcc acg 13894 
Pro Pro Ser Tyr Glu Thr.Val Met Gly Ala Ala Asp Ser Pro Ala Thr 
20 25 30 

ctg gag gcg etc tac gtc cct ccc cgc tac ctg ggg cct acg gag ggg 13942 
Leu Glu Ala Leu Tyr Val Pro pro Arg Tyr Leu Gly Pro Thr Glu Gly 
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3S 46. 



4S 



agg aac age ate cgt tac tea gag etg gcg ccg ctg tac gao acc ace 
Arg Asn Ser He Arg Tyr Ser 61u Leu Ala Pr5 Leu Tyr lap tH ?S 
^° 55 60 • .6s 

cgc gtg tac ctg gtg gat aac aag teg gcg gac ate acq tea ctd aac 
Arg val Tyr I^u Val Asp Asn Lys Ser 111 Lp ile-Sf se? LeS 

70 75 80 • • 

tac cag. aac gac cat age aac ttt ctg acc acg gtg gta caa aac aat 
Tyr Gin Asn Asp Hia Ser Asn Phe Leu Thr Thr Val Val Oln Asn Asn 
«S .90 95 . 

9C3«gc aeg cag acc ata aat ttc gac' gag 
Asp Phe Thr Pro Val Glu Ala 61y Thr Gin Thr He Asn Phe flp lll 
• "0 . 105 110 

cgc tog egg tgg gge ggc gai ctg aaa acc ate ctg cgc acc aac atg 
Arg Ser Arg Trp Gly Gly Asp- Leu Lys Thr He Leu A?g Thr Asn Met 
lis 120 • 125 

ccc aac ate aac gag ttc atg tec acc aac aag ttc agg gee egg ttg 
Pro Asn He Asn Glu Phe Met Ser Thr Asn Lys Phe A?! La aS* LeS 

135 140 14S 

atg gta gag aaa gtg aac aag gaa acc aat gee cct ega tac gag' tgg 
Met val Glu Lys Val Asn Lys Glu Thr Asn Ala Pro Aig Tyr Glu T?I 
150 ■ iss 

It^ flf ° ^^'^ ggc aac tac teg gag ace atg acc ata 

Phe Glu Phe Thr Leu Pro Glu Gly Asn Tyr ser Glu Thr Met Thr He 
165 170 175 

gac ctg atg aat aac gcg ate gtg gac aac tac ttg gaa gtg ggq eqq 
Asp Leu Met Asn Asn Ala jle val Asp Asn Tyr Leu 5lu Val Ql? aS 
"0 , 185 190 . " 

cag aac ggg gtg ctg gag &gc gac ate ggg gtg aag ttt gac aeg cgc 
Gin Asn Gly Val Leu Glu Ser Asp He Gly Val Lys Phe Lp tS-aS 
X9S 200. 



205 



X 9^^" gtc acc aag etg gtc atg ccc ggc 

Asn Phe Arg Leu Gly Trp Asp Pro Val Thr Lys Leu Val Mel Pro GlJ 

220 • 22S 



«»? 1^*^ flf*' 3*=*= -^^^ sac ate gtc ctg ctg ccc ggc 

Val Tyr Thr Asn Glu Ala Phe His Pro Asp He val Leu Leu Pro 6l5 
"0 235 240 

tgc ggc gtg gac ttc acg caCg age egg etg age aac ctg eta qm' ate 
Cys Gly Val Asp Phe Thr Gin Ser A^ Leu sir aJu teu Leu 61? Se 
245 250 255 • 

t^^ ^'"^ ^^"^ cag gcg ggt ttt cag ate atg tac gag gac 
Arg Lys Arg Met Pro Phe Gin Ala Gly Phe Gin He Met Tyr Glu Lp 
260 . 265 270 
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ctg gag ggc ggc aac ate ccc gcb ttg eta gac gtg aea aaa tai? -i^ie^-. 
Leu Glu Gly Gly Asn He Pro Ixa LeG Leu Lp ?S IS ^ 
27S 280 28S 



390 395 



4.00° 



V-? f f"^ ^ ^''3 aag age ttt tac aac gag cag gcc 

Val Qlu Leu Leu Pro Val His. Ala Lys Ser Phe Tyr Asn llu GXn III 
40S 



4IS 



14710 



9CC age att cag aag gcg egg gag cag ggc cag gag ate cgc qqc aac 
Ala ser He Gin Ly6 Ala Arg Glu Gin lly Gin 111 lie S^ Ify Isp 
290 295 300 . 305 

»™ ov*" tZ'^ f^^ ^^"^ 9**= S'^S 9*3 3t9 ccc gtg gW aag 14753 

Asn Phe Thr Val He Pro Arg Asp Val Glu He Val Pro Val Olu Ly? 

310 315 320 

gat age aag gac cgc agt tac aac eta etc ccc ggc gac cag acc aac i4aos 
Asp Ser Lys Asp Arg Ser Tyr Asn Leu Leu Pro lly Asp Gl? JSr isn 
325 330 335 • 

acg gcc tac cgc age tgg tte ctg gcc tac aac tac ggc gac ccc' aac i Aima 
Thr Ala Tyr Arg Ser Trp Phe Leu Ala Tyr Asn Tyr lly Pro III 

'345 350 

ff,! **=3 =•'3 '^'^3 acc acg gac gte acc tgc 14902 

Lys Gly Val Arg Ser Trp Jbk Leu Leu Thr Thr Thr Lp Val tS Cys 
355 360 365 

ggc teg cag cag gtg tac tgg teg etc ccg gac atg atg caa gac ccc i49so 
Gly ser Gin Gin Val Tyr Trp Ser Leu Prl lep Me? Mef Zp So 
"° 375 380 *^ 385 

v!? nK*' *3C caa gte age aac tac ccc gtg gtg gga 14998 

Val Thr Phe Arg Pro Ser Ser Gin Val Ser Asn Tyr Pro Val. Val lly . 



15046 



15094 



S'? *^*^3 cag etc ate cgc cag tec acc gcg etc acg"cac <itc tte 
val Tyr Ser Gin Leu He Arg Oln Ser Thr Ila Leu ^ vS Pte 

■ 425 430 

tt° 3ag aac cag ate ctg gtg cgc ccg ccc get ccg acc 15142 

Asn Arg Phe Pro Glu Asn Gin He Leu Val Aig Pro Pro ila pS ^ 

440 445 

tn^ ^^f Saa aac gtt ccc gcc etc aea gat cac gga acc isiso 

lie Thr Thr Val Ser Glu Asn* Val Pro Ala Leu Thr Lp His lly Thr ' 

455 .460 . 

Ctg ccg ctg cgc age agt ate agt gga gte cag' cgc gto acc ate act 
Leu Pro Leu Arg Ser Ser ile.slr lly Val Gin A^g Val ?hr I^e Thr 
^•'O 475 '480 

gac gcc egg cga agg acc tgc ccc tac gtg cac aag gcc ctQ dac ata is^nis 
ASP Ala Arg Arg Arg Thr Cys Pro Tyr Val His LyI I^a Leu lly ill 
485 490 495 • / 
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Ill f Z"" f ^^"^ ^^'^ «9« ttt taa caagcatgtc ' 15332 

Val Ala Pro hya Val Leu Ser Ser Arg Thr Phe acacgcc . 15332 

SOO . 505 

cattctcatc tcgcccgaca acaacapcgg ctggggcctg cgctcggccg gcatgtacgg 15392 

cggcgccaag cggcgctcca gcgagcacce cgtccgcgtc cgpSgccact accgggcccc IS452 

ctggggcgcc cacaagcgcg gcgtctccac gcgcaccacc gtcgacgacg ccat'cgaegc 15512 

cgtcgtggcc caggccagac gctact^ccg gcccaagtcg acggtggacg ccgtcatcga 15S72 

cagcgtggtg gccgacgcgc ggcgatacgc tcgaegcaag cggcgtctgc accgccgtcg as632 

ccgtcccacc gccgccatgc tggqcgceag agcggtcctg agacgcgcgc gccgcgtggg 15692 

ccgccgagcc atgcgccgag ccgcggccaa cgccagcgcg ggtcgcgccc gtcgtcaggc 15752 

cgcccggcag gccgccgccg cc^tcgccaa' cctggcccaa ccccgccggg gaaacgtgta 1S812 

cegggtgcga gacgcgtcgg gcgtgcgcgt gccggtgcgc acccgocccc ctcggagtta 15872 

gaagacaaaa agacggacga agactgagtt tccctgtcgt tgccagcatg agcaagcgca 15932 

agttcaaaga agagctgctg gaggccctcg tgcccgagat etacggcccg gccgccgctg 15992 

ccgccgcggt ggcggacgtc aagcccgaag ttaagceccg cgcgctgaag cgggttaaaa 16052 

agcgggaaaa gaaagaggag aagcaggaag eagggttgct agaogtcgac g?cggcgtgg 16112 

agttcgtgcg gtccttcgcg ccccgtcgcc gggtgcagtg gcggg&tcgc cgcgtcaagc 16172 

tcgtcccgcg gcegggcacc gtggtgtctt tcacccccgg cctgcgttcg gccacgcgcg 16232 

gcctgaagcg cgagtacgac gaggtctatg gcgacgaaga catcctggag caggccgccc 16292 

agcagctcgg ggagtttgct tacggcaagc gcggccgcta cggggaggtg gcgdtggcgc 16352 

tggaccaggg caatccoacg cccagcctca agcccgtcac gctgcagcag gtgctgcccg 16412 

tgagcgcgtc gaccgagagc aagcggggca tcaagaggga gatgggcgac cCgcagccca 16472 

ccatgcaact catggtgccc aaacggcaga agctggagga cgtgctggag aacatgaaag 16532 

tggatcccag catcgagccc gaagtgaaag tgcgacccat caaggaagtg ggcccgggcc I6S92 

taggcgtgca gacggtggac attcagatcc ccgtgcgcgc ctcccccgtt tbtgccacca 166S2 

ctacgacggc cgtggaggce atggaaacgc agacggagct gcccgcggcc ttggeggcag 16712 

ccgccaccgc cgcegcggct acccgagaga tgggcatgca gaccgacccc tggtacgagt 16772 

tcgccggccc cgcccgtcgt ccicgagccc gtcggtacgc ggcgaccacc tcccggctcc 16832 

ctgactacgt cttgcatcct tccatcacgc cgacgcccgg ctaccgcgga acgaccttcc 16892 

gccccggtcg cgcgcgcacc accacccgcc gtcgtcgcac cacccgccgc cgtcgcagcc 16952 
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Stcgcgcact ggctcccatc gcggttcgcc gcgtcgtccg ccggggtcgc acgctgaccc 17012 

tgcccaccgc gcgttaccac ccc^gcatcg tcatttaacc tgcgctgccg ttttgcagat 17072 

ggctctgacg tgccgctttc gctticcccgt tcggcactac cgaggaagat ctcgccgtag 17132 

gactggtcta gcgggcagcg gtctdcgacg ccgccgccgc gcggtgcacc ggcgcatgaa 17192 

gggcggcatt ctgcccgcgc tgatccccat tatcgccgcc gccatcgggg cgatccccgg 17252 

cgtggcctcg gtggccttgc aagcagctcg caaaaattaa ataaagaagg cttgacactc 17312 

actgcctggt cctgactgtt tcatgcagac aagacatgga agacatcaat tttgcgtcgt 17372 

tggccccgcg gcacggctcg cggccgttca tgggcacctg gaacgagatc ggcaccagcc 17432 

agctcaacgg gggcgctttc agttggagca gcctgtggag cggcattiaaa aacC&tgggt 17492 

ccacgattaa gacctatggc aacasiggcgt ggaacagtag cactggtcag atgctccgcg 17552 

ataagctgaa ggaccagaac tcccagcaga' aagtggtaga cggtctggcc tcgggcatca 17612 

acggggtggt: ggacctggcc aaccaggcgg tgcagaacca gatcaaccag cgtctggaga 17672 

acagccgcca gccgcccgcg gccctgcagc agcgtccgca ggtggaggag gtggaagtgg 17732 

aggagaagct gccgcccctg gagacggtgt cgccggtggg cgtgccCagc aagggggaga 17792 

agcggccgcg gcccgagctc gaggagaccc tagtgaccga gaccctggag ccgccctcgt 17852 

acgagcaggc cttgaaagag ggggqcacgc ccctgcccat gacccggccc atcggaccca 17912 

tggcccgacc ggtctacggc aag^aacaca aagccgtgac gctagagctg cctccgccgg 17972 

cgcccaccgt acccccgatg cccggt'ccca ccctgggcac cgccgtgcct cgtcccgccg 18032 

ccGcgccggt cgccgtggcc acgcccgcgc gcccgagtcg cggagccaac tggcagagca 18092 

ctctgaacag catcgtgggc ctgggagtga aaagcctgaa . acgccgccgg tgttactatt 18152 

aaagccagct aaatacccat gtgttgtatg cgcctcctgt gtcacgccag aaaaagccag 18212 

ccgagtgacg ggtcaccgcc gccgccaaga gcgccgcttt caag atg gcc acc ccc 18268 

Met Ala ThrPro 
510 

teg atg atg ccg cag tgg tct; tac atg cac ate gcc ggg cag gac gcc * 18316 
Ser Met Met Pro Gin Trp Ser Tyr Met His lie Ala Gly Gin Asp Ala 
515 520 525 

teg gag tac ctg age ccg ggc ctg gtg cag ttc gcc cgc gcc acc gac 18364 
Ser Glu Tyr Xieu Ser Pro Gly Leu Val Gin Phe Ala Arg Ala Thr 'Asp 
530 535 • 540 

acg tac ttc age ctg ggc aac aag ttt agg aac ccc acg gtg gcc ccc 18412 
Thr Tyr Phe Ser lieu Gly Asn Lys Phe Arg Asn Pro Thr Val Ala Pro 
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545 550 555 • 560 

acc cac gac gtg acg acg gac egg tec cag egg ctg acg ctg egg ttc 18460 
Thr His Asp Val Thr Thr Asp Arg Ser Oln Arg Leu Thr Leu Arg Phe 
565 570 575 ' 

gtg ccc gtc gac cgc gag gac< acc* gcg tac teg tac aaa gtg cgc ttc * 18508 
Val Pro Val Asp Arg Glu Asp Thr Ala Tyr Ser Tyr- Lys Val Arg Phe 
580 585 590 

aeg ctg gcc gtg ggc gae aac cgc gtg ctg gae atg gee age acg tac 18556 
Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr 
595 600 60S 

ttt gac ate cgc ggc gtg ttg gac cgc ggt cee age ttc aaa ccc *tac 18604 
Phe Asp lie Arg Gly Val Leu Asp Arg Gly Pro Ser Phe Lys- Pro Tyr , 
610 615 . 620 

tec ggc ace gcc tac aac tec ctg gee ccc aag ggc gee ccc aac ccg 18652 
Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly Ala Pro Asn Pro 
625 630 , . * 635 640 

tea gaa tgg aag ggc tea gae aac aaa att agt gta aga ggt cag get 18700 
Ser Glu Trp Lys Gly Ser Asp Asn Lys lie Ser Val Arg Gly Gin Ala 
645 650 655 

ccg ttt ttt agt aca tec att aca aag gat ggt att caa gtg gcc act 18748 
Pro Phe Phe Ser Thr Ser lie Thr Lys Asp Gly lie Gin Val Ala .Thr 
660 665 670 

gat act tet age gga get gtg tat get aaa aag gaa tat cag cet gaa 18796 
Asp Thr Ser Ser Gly Ala Val 'Tyr Ala Lys Lys Glu Tyr Gin Pro Glu 
675 680 685 

oca caa gta ggg caa gaa caa tgg aac age gaa gee agt gat agt- gat 18844 
Pro Gin Val Gly Gin Glu Gin *Trp Asn Ser Glu Ala Ser Asp Ser Asp 
690 -695 700 

aaa gta get ggt agg att eta .aaa gae aca aca cee atg ttc cet tgt 18892 
Lys Val Ala Gly Arg lie Leu. .Lys Asp Thr Thr Pro Met Phe Pro Cys 
70S 710 715 • 720 

tac ggt tec tac gcc aag ecc aca aat gaa cag ggg ggg eaa 'ggc.act 18940 
Tyr Gly Ser Tyr Ala Lys Pro Thr Asn Glu Gin Gly Gly Gin Gly Thr 
725 • 730 735 

aat act gta gat ctg cag ttc ttt gcc tet tea teg get acc .tet. acg 18988. 
Asn Thr Val Asp Leu Gin Phe Phe Ala Ser Ser Ser Ala Thr Ser Thr 
740 745 750 

cet aaa gcc gta etc tat gcc gag gac gtg gea ata gaa gea cea gac 19036 
Pro Lys Ala Val Leu Tyr Ala Glu Asp Val Ala lie Glu Ala 'Pro* Asp 
755 *760 765 . • 

acc eat ttg gtg tac aaa ccg gea gtt aca acc acg acc act agt tec 19084 
Thr His Leu Val Tyr Lys Pro Ala Val Thr Thr Thr Thr Thr ,Ser Ser 
770 775 780 
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caa gac ctg eta act eag cag get get cec aac ega ccc aac tac att 19132 
Gin Asp Leu Leu Thr Gin Gin Ala Ala Pro Asn Arg Pro Asn Tyr He 
785 790 795 .800 

ggc ttc agg gat aat ttt ate ggt etc atg tat tac aac tec act ggc 19180 
Gly Phe Arg Asp Asn Phe He Gly Leu Met Tyr Tyr. Asn Ser Thr Gly 
805 ^ 810 . 815 

aat atg ggt gtt ttg gca ggg caa get tct cag eta aac geg gtg gtt 19228 
Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala Val Val 
820 825 830 

gac ttg caa gac aga aac acc gag ctg tec tac cag etc atg ett gat 19276 
Asp Leu Gin Asp Arg Asn Thr, Glu Leu Ser Tyr Gin Leu Met Leu. Asp 
835 840 845 

get ttg ggc gac cgc agt cgt.tac ttc tec atg tgg aac cag gcc gta 19324 
Ala Leu Gly Asp Arg Ser Arg Tyr Phe Ser Met Trp Asn Gin Ala Val 
650 855 860 

gac age tat gac ect gat gtc aga att att gaa aat cat ggt gtg gag 19372 
Asp Ser Tyr Asp Pro Asp Val Arg He He Glu Asn His Gly Val Glu 
86S 870 . 875 880 

gat gag ctg cca aac tac tgt ttc ecg eta gga ggg teg eta gta act 19420 
Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Gly Gly Ser Leu Val Thr 
885 890 895 

gaa act tat.aca ggc eta tea cec caa aac gga agt aac acg tgg.aca 19468 
Glu Thr Tyr Thr Gly Leu Ser .Pro Gin Asn Gly Ser Asn Thr Trp .Thr 
900 ' * 905 910 

ace gac age ace acc tat gca act aga ggg gtg gaa ate ggc tct ggc 19516 
Thr Asp Ser Thr Thr Tyr Ala Thr Arg Gly Val Glu He Gly Sei; Gly 
915 920 925 

aac atg ttc gcc atg gaa att aat ttg gcg gcc aat eta tgg agg agt 19564 
Asn Met Phe Ala Met Glu He Asn Leu Ala Ala Asn Leu Trp Arg Ser 
930 935 940 

ttc ctg tac tec aac gtg gcc ctg tac ctg ccc gac gag tac aag etc 19612 
Phe Leu Tyr Ser Asn Val Ala Leu Tyr Leu Pro Asp Glu Tyr Lys Leu 
945 950 955 960 

acc ccc gac aac ate acc etc ccc gac aac aaa aac act tac gac -tac 19660 
Thr Pro Asp Asn He - Thr Leu. Pro Asp Asn Lys Asn Thr Tyr Asp .Tyr 
965 - • 970 975 * 

atg aac ggc cgc gtg gcc gcc cec age tec etc gac acc tac gtc aac 19708 
Met Asn Gly Arg Val Ala Ala Pro Ser Ser Leu Asp Thr Tyr Val Asn 
980 985 990 

ate ggg gcg cgc tgg tee ccc gac ccc atg gac aac gtc aac* ccc ttc 19756 
He Gly Ala Arg Trp Ser Pro Asp Pro Met Asp Asn Val Asn Prp Phe 
995 ' 1000 1005 
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aac cac cac cgc aac gcg gga ctg cgc tac cgc tec atg ctg ctg 
Asn His His Arg Asn Ala Qly Leu Arg Tyr Arg Ser Met Leu Leu 
1010 'idl5 1020 



19801 



ggc aac ggc cgc tac gta ccc ttc cac ate caa gtg ccc cag aaa 
Gly Asn Gly Arg Tyr Val Pro Phe His lie Gin Val Pro Gin Lys 
1025 ' 1030 1035 



19846 



ttc ttc gcc ate aaa aac etc ctg etc etc ccc ggg tec tac acc 
Phe Phe Ala lie Lys Aan Leu- Leu Leu Leu Pro Gly Ser Tyr Thr 
1040 104'5 1050 



19891 



tac gag tgg aac ttc cgc aag gae gtc aac atg ate etc cag age 
Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met lie Leu Glh'ser 
1055 1060 1065 



19936 



age ctg ggt aac gae etc cgc 
Ser Leu Gly Asn Asp Leu Arg 
1070 1075 



gtc gae ggg gcc age gtc agg ttc 
Val Asp Gly Ala Ser Val Arg Phe 
1080 



19981 



gae age ate aac ctg tac- gcc aac ttc ttc ccc atg gcc cac aac 
Asp Ser lie Asn Leu Tyr. Ala Asn Phe Phe Pro Met Ala His Asn 
1085 . 1090 1095 



20026 



acc gcc tec acg etc gag gcc atg ctg cgc aac gae 
Thr Ala Ser Thr Leu Glu Ala- Met Leu Arg Asn Asp 
1100 1105 1110 



acc aac gae 
Thr Asn Asp 



20071 



cag teg ttc aac gae tac etc 
Gin Ser Phe Asn Asp Tyr Leu 
1115 . 1120 



tgc get gcc aac atg 
Cys Ala Ala Asn Met 
.1125 



etc tac ccc 
Leu Tyr Pro 



20116 



ate ccc gcc aac gcc ace age gtg ccc ate tec att 
lie Pro Ala Asn Ala Thr Ser Val Pro lie Ser Xle 
1130 1135 1140 



ccc teg egg 
Pro Ser Arg 



aac tgg gcc gcc ttc egg ggc tgg age ttc ace egg etc aag ace 
Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr 
1145 1150 1155 

aag gag acc ccc tet ctg ggc tec ggc ttc gat ccc tac ttc acc 
Lys Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe Thr 
1160 1165 1170 



20161 



20206 



20251 



tac teg ggc tee ate ccc tac' ctg gae ggc acc ttc tac etc a'ac 
Tyr Ser Gly Ser He Pro Tyr Leu Asp Gly Thr Phe Tyr Leu Asn. 
1175 1180 1185 



20296 



cac act ttc aag aag gtc tee 
His Thr Phe Lys Lys Val Ser. 
1190 1195 



ate atg ttc gae tec tec gtc age 
He Met Phe Asp Ser Ser Val Ser 
1200 



20341 



tgg ccc ggc aac gae cgc. ctg 
Trp . Pro Gly Asn Asp Arg Leu 
1205 • i210 



ctg acc ccc aac gag 
Leu Thr Pro Asn Glu 
1215 



ttc gag ate 
Phe Glu' He 



20386 



aag cgc ace gtg gae ggg gaa ggg tac aac gtg gee cag tgc aac 



20431 
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I 



™ Val Ala Gin Cys Asn 

"20 1225 1230 



atg acc aag gac tgg ttc etc 
Met Thr Lys Asp Trp Phe- Leu 
1235 1240 

ate ggc tac cag ggc ttc'tac 
He Qly Tyr Gin Gly Phe TVr 
1250 1^55 

atg tac tot ttc ttc cgc, aac 
Met Tyr Ser Phe Phe Arg Asn 
1265 1270 



ate cag atg etc age 
He Gin Met lieu Ser 
1245 

gtg ccQ gag ggc tac 
Val Pro Glu Gly Tyr 

1260. 

ttc caa cce atg age 
Phe Gin Pro Met ser 
1275 



cac tac aac 
His Tyr Asn 



aag gac agg 
I#ys Asp 'Arg 



cgc cag' gtg 
Arg Gin Val 



S ? f fl^^ ^^"^ ^^"^ aaa aac gtc acc etc ccc 

^oL ^^"^ ^^'^ Asn val Thr Leu Pro 

"BO 1285 1290 



ttc cag cac aac aac teg ggg 
Phe Gin His Asn Asn Ser Gly 
1295 1300 



ttc gtg gga tac atg 
Phe Val Gly Tyr Met 
1305 



ggc ccc acc 
Gly Pro Thr 



atg cgc gag ggg cag gee tac ccc gee aac tac ccc tac ccc ctg 
Met Arg Glu Gly Gin Ala Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu 

1315 i320 



ate ggc aag acc gee gtg ccc age etc acg cag aaa 
He Gly Lys Thr Ala Val Pro Ser Leu Thr Gin Lys 
1325 1330 1335 



aag 
Lys 



ttc etc 
Phe Leu 



tge gac cgc acc atg tgg cgc ate ccc ttc tee agt aac ttc -atg 

'^^^ A'^S P~ ser Ser Asn Phe* Met 

1340 1345 1350 



tec atg ggg geg etc acc gac ctg ggg cag aac atg 
Ser Met Gly Ala Leu Thr Asp Leu Gly Gin Asn Met 
1355 1360 1365 



ctg tac gee 
Leu Tyr Ala 



aac tec gee cac gcc etc gac 
Asn Ser Ala His Ala Leu Asp 
• 1370 1375 



atg ace ttc gag gtg gac ccc atg 
Met Thr Phe Glu Val Asp Pro Met 
1380 



gat gag ccc acg ctt etc tat gtt ctg ttc gaa gtg 
Asp Glu Pro Thr Leu Leu Tyr Val Leu Phe Glu Val 
• 13fi5 1390 1395 



gtg cgc ate cac cag ecg cac 
Val Arg He His Gin Pro His 
1400 1405 



c:9c ggc gtc ate gag 
Arg Gly Val He Glu 
1410 



ttc gac gtc 
Phe Asp Val 



gcc gtc tac 
Ala Val Tyr 



20476 



20521 



20566 



20611 



20656 



20701 



20746 



20791 



20836 



20881 



20926 



20971 



Ctg cgc acg ecg ttc teg gcc* ggt aac gcc acc acc taa ggagggggee 21020 
Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr • 93993°^ 

1415 1420 1425 

gccgacggat gggctccagc gagccggagc tggtegceat cgcgcgcgac ctgggctgcg 21080 
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99ccctactt cctgggcacc tttgacaaac gcttcccggg cttcgtggcg ccgcacaagc 21140 

tggcctgcgc catcgtcaac accgccggac gcgagaccgg cggcgtccac tggctggccc 21200 

tggcctggaa cccccgcagc cgaacctgct acctcttcga ccccttcggc ttctcggacg 21260 

acaggctcag gcagatctac cagttcgagt acgaaggcct gctccggcgc agcgccctcg 21320 

cctccacccc cgaccactgc gtcaccctcg tcaagtccac ccagaccgtc caggggcccc 21380 

gctcggccgc ctgcggcctc ttctgctgca tgttcctgca cgccttcgtg cgctggcccg 21440 

cctcccccat ggacggcaac cccaccatgg acctccttac gggcgttccc aacagcatgc 21500 

ttcagagtcc ccaggtcgag cccaccctcc accgcaacca ggaggaactc tacgccttcc 21560 

tggctcggca ctccccctac tttcgccgcc accgcgagcg catagaaaag gccficcgcgt 21620 

ttgacaaaat gaacgactag attttctgtg aaaaacactc aataaagcct ttattggttc 21680 

accacacgtg cacgcatgca gactttttat ttaaaagggc tccgcctcct cgtcgccgtg 21740 

gctggtgggg agggagacgt tgcgatactg caggcgggag ctccatctga actpgggaat 21800 

cagcagcttg ggcagggggc cctcgacgtt ctcgctccac agcttgcgca ccagctgcag 21860 

ggcgcccagc aggtcgggcg cggagatctt gaagtcgcag ttggggccct ggttgccgcgi 21920 

ggagttgcgg tacaccgggt tggcgcactg gaacaccagc acgctggggt gctcgatgct 21980 

ggccagcgcc gtcttgtcgg tcacctcgtc gcqgcgcagg gactccgcgt tgctcagcgc 22040 

gaaggcggtc agcttgcaca gctgccgacc cagcacgggc accccgctcg gctggttcag 22100 

gcagtcgcag cgcatagcca tcagcagccg cttctgcccg tgctgcatct tcggatagtc 22160 

ggctcgcatg aaggcctcca tctgccggaa ggccgtctgc gccttgctgc cctccgagaa 22220 

gaacagcccg caggacttgc cggagaacac gttgttgccg cagctcacgt cttccacgca 22280 

gcagcgcgcg tcgtcgttct teagctgcac cacgctgcgg ccccagcggt tctgcaccac 22340 

cttggtcttg ccgggatgtk ccttcagggc ccgctggccg ttctcgctgg tcacgtccat 22400 

ctccaccacc tgctccttct ggatcatctc cagcccgtgg tagcagcgca gbacgccctc 22460 

ctgctoggtg cacccgtgca gccagacggc gcagccggtc ggctccagct gttgaggtt.t 225.20 

caccccggcg taggtctcca cgtacgcccg caggaagcgg cccatcatct ccacaaaggt 22580 

cttctgaccg gtgaaggtca gctgcagccc gcgatgctcc tcgttgagcc acgtctgaca 22640 

gatcttgcgg tacaccttgc cctgctcggg cagaaacttg aaagcggcct tctccccggg 22700 

ctccacgtgg tacttctcca tca^cgccga catcagctcc atgcccttct ccca^gccga 22760 

caccagcggc tccgcgcggg ggttcaccac cgccatgcct cgggaagtgc cggggcgctc 22820 

142 



atcttcctcc tcctcctcgt cttctt.cttg aggcggcggt ggcggcagtt gtctcacgaa 22880 
tctcttgccg ttggccttck ggsjcgdtctc cacgccgggg tgggtgaacc cgtgggccac 22940 
caccacttcg tcctcttcct cttcgctgtc gggcacgact tcgggagagg gaggcggcgg 23000 
aggaaccggt gcggccactg cggccatcgc ggcgttcttg cgcgccttct tggggggcag 23060 
aggcggcgtc tcgcgctccg ggctggtctc ttgcaggtag ggcgtgatgg tgtgggaggt 23120 
ggggcgctct ggctgacggc cggccatgct gatgcttgac tcctaggcga aaagatggag 23180 
gaggatctta gacagccgca gcccgtctcc gaaaccttaa ccacccccgc ctctgaggtc 23240 
ggcgccggcg agctagacat gcaacgggag gaggaggagg acgtgcgagt ggagcaagac 23300 
ccgggctacg tgacgccgcc cgaggacggc gaggagccgc aggcaccggc gccaacgctc 23360 
agcgaagccg actacctggg aggggaggac gacgtgctgc tgaagcacct ggcgcggcag 23420 
agcaccatcg tgcaggaggc cctcaaggag cgcgagg'agg tcccgctgac ggtggaggag 23480 
ctcagccggg PQtacgaagjc caacctcttc ,tcgccgcggg tgccccccaa .gaagcaggdc 2.3540 
aacggcacct gcgagcccaa cccccgcctc aacttctacc ccgtctttgc ggtgcccgag 23600 
gcgctggcca cccatcacat cttcttcaag aaccagcgca tccccctctc gtgccgcgcc 23660 
aaccgcaccc gcgccgaccg cctcctgcat ctccgagccg gcgccgccat acctgagatc 23720 
gcctccctgg aggaagtccc caagatcttc gaaggtctcg gcaaggacga gaagcgcgcg. 23780 
gcaaacgctc tggaaaagaa cgagagcgag ggtcagaacg tgctggtcga gctggaaggc 23840 
gacaacgcgc gtctggccgt gctcaaacgc accatcgaag tctcccactt cgcctacccc 23900 
gcgctcaacc ttccccccaa ggtcatgcgc tcggtcatgg atcagctgct catcaagcgc 23960 
gccgagcccc tcgagaacga ctccgaggtg gattccgagg acggaaaacc cgtggtctcg 24020 
gacgaggagc tcgcgcgctg gctgggcacg caggaccccg ccgagttgca agagcggcgc 24080 
aagatgatga tggcggccgt gctggtcacc gccgagctcg agtgcctgca gcgct'tcttc 24140 
gccgaccccc agaccctgcg caaggtdgag gagtccctgc actacgcctt ccgccacggc 242Q0 
tacgtgcgcc aggcctfgcaa gatctccaac gtggagctta gcaacctggt ctcctacatg 24260 
ggcatcctgc acgagaaccg cctcgggcag aacgtcctcc actgcaccct gaccggggag 24320 
gcccgccgcg actacgtccg cgactgcatc tacctctttc tcaccctcac ctggcagacc 24380 
gccatggggg tctggcagca gtgtctggag gagcgcaacc ttcgcgagct cgacaagcta 24440 
ctgagccgcg agcgccgcga gctctggacg gctttcagcg agcgcaccgc cgcctgccgb 24500 
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ctggccgacc tcatcttccc cgagc.gactc aggcaaaccc tccagaacgg cctgc.ccgac 24560 
tttgtcagcc agagcatgct gcaaaacttt cgctccttca tcctggagcg atccggcatc 24620 
ttgcccgcca tgagctgcgc cctgccctcc gatttcgtcc ccctctatta tcgcgagtgc 24680 
cccccgccgc tctggagcca ctgct'acctg ctgcgtctgg ccaactacct cgcccaccac 24740 
tccgacctca tggaagactc cagcggcgag gggctgctgg agtgccactg ccgctgcaac 24800 
ctctgcacGC cccaccgctc gctggtctgc aacaccgagc tgctcagcga gacgcaagtg 24860 
atcggtacct ttgagatcca ggg'aptagag gggccggagg gtgcttccaa cctcaagctc 24920 
agcccggcgc tctggacttc cgcctacctg cgcaaattta tccccgagga ctatcacgcc 24980 
caccagatcc aattctacga agaccaatcg cgacccccca aagcccccct cacggcctgt 25040 
gtcatcaccc agagccagat tctggcccaa ttgcaagcca tccagcaggc ccgccaagag 25100 
ttcctcctga aaaagggtca cggggtctat ctggaccccc agaccggcga ggaactcaac 25160 
accccgtcac cctccgccgc cgcttcgtgc cgcccgcaga accatgccgc ccaaagggaa 25220 
caagcaggcc atcgcccagc ggcgggccaa gaagcagcaa gagctccagg agcagtggga 25280 
cgaggagtcc tgggacagcc aggcggagga agtctcagac gaggaggagg acatggagag 25340 
ctgggacagc ctagacgagg aggaggaggc cgaggagcta gaggacgagc ctctcgagga 25400 
ggaagagccc agcagcgccg cggcaccatc ggcttccaaa gaagcggctc ggagccggcc 25460 
ggccccgaag cagcagaagc agcaacagcc gccaccgtcg cccccgacgc caccaccagg 25520 
ctcactcaaa gccagccgta ggtgggacgc ggtgtccatc gcgggatcgc ccaaagcccc 25580 
agtcggtaag ccacccgggc ggtcggggcg ggggtactgt tcctggcgcc cccacaagag 25640 
caagatcgtc gcctgcctcc agcactgccg gggcaacatc tccttcgcgc ggcgctactt 25700 
gctcttccac gacggggtgg cggtgc'cgcg caacgtcctc tactattacc gteatctcta 25760 
cagcccctac gagacagaag gcccggcctc cgcgtaagac cagccgccag acggtctcct 25820 
ccgccatcgc gacccgccag gactcggccg ccacgcagga gctcagaaaa cgcatctttc 25880 
cca'ccctgta tgctatcttc cagcagagcc gcggccagca gctggaactg aaagtaaaaa 25940 
accgctccct gcgttcgctc acccgcagct gtctgtacca caggagggaa gaccaactgc 26000 
agcgcacgct cgaggacgcc gaggcactgt tcaataaata ctgctcggtg tc'tcttaagg 26060 
actgaaagcG cgcgcttttt cagaggctca ctacgtcatc atcatcatga gcaaggacat 26120 
tcccacgcct tacatgtgga gctaccagcc gcagatggga ctggcggccg gcgcctccca 26180 
ggattactcc agtcgcatga actggctgag tgccggcccc cacatgatcg ggcgggtcaa 26240 
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tgggattcgt gccacccgca atcagatact gctggaacag gccgccctca cctccacccc 26300 

gcgacgtcag ctgaacccgc ccgcttggcc cgccgcccag gtgtaccagg aaaaccccgc 26360 

cccgaccaca gtcctcctgc cacgcgacgc ggaggccgaa gtccagatga ctaactccgg 26420 

ggcgcaatta gcgggcggcg cccgccacgt cgtcgctccc gggtacagag gtcggcccgc 26480 

accctacccc tccggccct:a taae^gaggct gatcattcga ggccgaggta tccagctcaa 26540 

cgacgaggtg gtgagctcct cgaccggtct tcggcccgac ggagtcttcc agcttggagg 26600 

cgccggccgc tcttccttca ccactcgcca ggcctacctg acgctccaga gctcttcctc 26660 

ccagcctcgc tccggcggca tcggcaccct ccagttcgtg gaggagttcg tgccctcggt 26720 

ctacttcaac ccgttctccg gctctcccgg ccgctacccg gacagcttca tccccaacta 26780 

cgacgcggtg agcgaaticcg tggacggcta cgattgatga ccgatggtgc ggccgtaact 26840 

gcgcggcggc aacatctgca tcactgccat cgtcctcggt gcttcgcccg ggaggcctgt 26900 

gagttcatct acttccagct cgccccggac cagcttcagg gcccttcgca cggcgttaag 26960 

ctcgtgatag aggaagagct cgagagtagc tgcctgcgct gttttacctc gcgccccatic 27020 

ctagtcgaga gggaacgcgg taggaccacc ctcaccctct actgcatctg tgactccccg 27080 

gaatnacatg aagatctgtg ttgcc'ttcta tgtgccgaac aataacccct cttgtaacta 27140 

cctacatcca caataaacca gaatttggaa actcctttcg tfctgtttgca g atg aaa 27197 

Met Lys 



cgc gcc cgc etc gac gac gac ttc aac ccc gtc tac ccc tat gac 
Arg Ala Arg Leu Asp Asp Asp Phe Asn Pro Val Tyr Pro Tyr Asp 
1430 \ 1435 1440 

act ccc aac get ccc tct gtt ccc ttc ate act cct ccc ttc gtc 
Thr Pro Asn Ala Pro Ser Val Pro Phe lie Thr Pro Pro Phe Val 
1445 1450 1455 

tec teg gac ggc ttg caa gaa* aaa cca ccc gga atg etc agt etc 
Ser Ser Asp Gly Leu Gln .Olu Lys Pro Pro Gly Met Leu Ser Leu 
1460 ^ 1465 1470 

aac tac caa gat cct att ace' acc caa aac ggg gea tta act eta 
Asn Tyr Oln Asp Pro lie Thr Thr Gin Asn Gly Ala Leu Thr. Leu 
1475 1480 1485 

aag ctt ggc age gga etg aac ata aac caa gat ggg gaa ctt acc 
Lys Leu Gly Ser Gly Leu* Asn lie Asn Gin Asp Gly Glu Leu Thr 
1490 1495 1500 

tea gac gcc age gtt etc gtc act ccc ccc att aca aaa gcc aac 
Ser Asp Ala Ser Val Leu Val Thr Pro Pro He Thr Lys Ala Asn 



27242 



27287 



27332 



27377 



27422 



27467 
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ISOS 



X510 



1515 



aac aca ata ggc eta gcc ttc aat gca cct ctt acc ttg caa age 
Asn Thr lie Gly Leu Ala- Phe Asn Ala Pro Leu Thr Leu Gin Ser 
1520 * 1525 1530 



27512 



gat act tta aat ctt get tgt aac gee cca ctt ace gtg caa gac 
Asp Thr Leu Asn Leu .Ala Cys Asn Ala Pro Leu Thr Val Gin Asp 
1535 1540 1545 



27557 



aat agg ttg gga ata aca tac aac tct ccc etc Sice ttg caa aac 
Asn Arg Leu Gly lie Thr .Tyr Asn Ser Pro Leu Thr Leu Glxi Asn 
1550 1555 1560 



27602 



age gaa ctt gee eta gcg .gtc acc ecg cct ctt gac act * gee aat 
ser Glu Leu Ala Leu Ala Val Thr Pro Pro Leu Asp Thr Ala Asn 
1565 * 1570 1575 



27647 



aac aca ctt gcg ctt aaa acc gee egg cct ata att aca aac tct 
Asn Thr Leu Ala Leu Lys'Thr Ala Arg Pro lie lie Thr Asn Ser 
1580 1585 1590 



27692 



aat aac gag ctt aca ctd tdc get gat get ccc eta aac acc age 
Asn Asn Glu Leu Thr Leu Ser Ala Asp Ala Pro Leu Asn Thr Ser 
1595 • 1600 1605 



27737 



aeg ggt ace etc cgc eta caa age gca gca cca ctg ggg eta gtt 
Thr Gly Thr Leu Arg Leu Gin Ser Ala Ala Pro Leu Gly Leu' .Val 
1610 • 1615 1620 



27782 



.gac caa acc ctg cga gtg -ctt ttt 
Asp Gin Thr Leu Arg Val Leu Phe 
1625 ' 1630 



tct aac cca .etc tac ttg, caa 
Ser Asn Pro Leu Tyr Leu' Gin 
1635 



27827 



aac aac ttt etc tea eta gee att gaa cgc cca ttg get tta act 
Asn Asn Phe Leu Ser Leu Ala lie Glu Arg Pro Leu Ala Leu Thr 
1640 1645 1650 



27872 



acc act ggt tct atg get atg eag att tec caa cca tta aaa gtg 
Thr Thr Gly Ser Met Ala Met Gin lie Ser Gin Pro Leu Lys Val 
1655 1660 1665 



27917 



gaa gac gga age tta age .ttg age att gaa age cct eta aat eta 
Glu Asp Gly Ser Leu Ser Leu Ser lie Glu Ser Pro Leu Asn Leu 
1670 1675 1680 



27962 



aaa aac gga aat ctt act ' tta gga acc caa agt ccc eta . act gtc 
Lys Asn Gly Asn Leu Thr lieu Gly Thr Gin Ser Pro Leu Thr Val 
1685 1690 1695 



28007 



act ggt aac aac etc age ctt aca aca aca gee cca tta acg.gtt 
Thr Gly Asn Asn Leu Ser Leii Thr Thr Thr Ala Pro I*eu Thar Val 
1700 1705 1710 



28052 



cag aac aac get eta gee etc tea gtg tta ctg ecg ctt aga eta 
Gin' Asn Asn Ala Leu Ala Leu Ser Val Leu Leu Pro Leu Arg Leu 
1715 " 1720 1725 



28097 
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ttt aab aac acc tea ctg gga gtg gca ttc aac cca ccc 
Phe Asn Asn Thr Ser Leu 61y Val Ala Phe Asn Pro Pro 
1730 1735 1740 



att .tct 
He iSer 



28142 



tea gca aac aac ggg ctg tct ctt gac att gga aat ggc ctt aca 
Ser Ala Asn Asn Gly Leu Ser Leu Asp He Qly Asn Gly Leu Thr 
. 1745 1750 1755 



28187 



ctg eaa tac aac agg etc gta gtg aac att ggc ggc ggg eta cag 
Leu Gin Tyr Asn Arg Leu Val Val Asn He Gly Gly Gly Leu Gin 
1760 . 1765 1770 



28232 



ttt aac aac ggt get att. acc get 
Phe Asn Asn Gly Ala He Thr Ala 
1775 1780 



tec ata aat gca get ctg. ceg 
Ser He Asn Ala Ala Leu Pro 
1785 



28277 



ttg cag tat tec aat aac cag ctt 
Leu Gin Tyr Ser Asn Asn Gin Leu 
1790 * 1795 



tct ctt aat att gga ggc ggg 
Ser Leu Asn He Gly Gly Gly 
1800 



28322 



ctg cga tac aac ggc act tac aaa aat tta gee gtc aaa acc gac 
Leu Arg Tyr Asn Gly Thr Tyr Lya Asn Leu Ala Val Lys Thr* Asp 
1805 • 1810 1815 



28367 



tct ttt agg ggt ctt gaa att gae 
Ser Phe Arg Gly Leu Glu He Asp 
1820 . 1825 



agt aat cag tte ctg gtg cca 
Ser Asn Gin Phe Leu Val Pro 
1830 



28412 



aga ctg ggt tct ggt eta aag ttt gat eaa tat ggg tac att. age 
Arg Leu Gly Ser Gly Leu Lys Phe Asp Gin Tyr Gly Tyr He 'Ser 
1835 1840 1845 



28457 



gtc ata cet cea act gtt aeg cca 
Val He Pro Pro Thr Val Thr Pro 
1850 1855 



aca aca ctt tgg act aca gca 
Thr Thr Leu Trp Thr Thr Ala 
1860 



28502 



gac cet tct 
Asp Pro Ser 
1665 



CCC aac get act 
Pro Asn Ala Thr 



ttt tac gae age tta gat ^ct aag 
Phe Tyr Asp Ser Leu Asp Ala, Lys 
1870 1875 



28547 



gta tgg ctg gee tta gta aaa tge 
Val Trp Leu Ala Leu Val Lys Cys 
1880 1885 



aac ggc atg gtt aat gga acc 
Asn Gly Met Val Asn Gly Thr 
1890 . 



28592 



ata gee ata aag get tta aaa ggt act ctg etc eaa cet aeg* .get 
He Ala He Lys Ala Leu Lys Gly Thr Leu Leu Gin Pro Thr Ala 
1895 1900 1905 



28637 



agt ttt att tct ttt gtt atg tat 
Ser Phe He Ser Phe Val Met Tyr 
1910 1915 



ttt tac age aat ggc ace aga 
Phe Tyr Ser Asn Gly Thr Arg 
1920 



28682 



aga act aac tac ccc aeg ttt gaa aat gaa ggc ata eta get agt 
Arg Thr Asn Tyr Pro Thr Phe Glu Asn Glu Gly He Leu Ala Ser 
1925 . 1930 1935 



28727 
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agt get aca tgg ggt tat cgt caa gga aac teg gea aac acc aac 28772 
Ser Ala Thr Trp Gly Tyr Arg. Gln Gly Asn Ser Ala Asn Thr Asn 
1940 1945 3^35Q 

gtc acc agt gee gtt gaa ttt atg cct age tec aea aga tat 'cct 28817 
Val Thr Ser Ala Val Glu Phe Met Pro Ser Ser Thr Arg Tyr Pro 

1955 I960 . 1965 

, • 

gtt aac aag ggt act gag gtt cag aac atg gaa etc acc tac act 28862 
Val Asn X/ya Gly Thr Glu Val' Gin Asn Met Glu Leu Thr Tyr Thr 
1970 1975 1980 

ttc ttg cag gga gac ccc act atg gcc ata tea ttt caa get att 28907 
Phe Leu Gin Gly Asp Pro Thr Met Ala lie Ser Phe Gin Ala lie 
1985 . 1990 1995 

tat aac eat get ttg gaa ggt. tac tct tta aaa ttt acc tgg- cga 28952 
Tyr Asn His Ala Leu Glu Gly Tyr Ser Leu Lys Phe Thr Trp Airg 
2000 2005 2010 

gtt cgc aac agg gaa cge ttt gat ate ccc tgc tgt tct ttt tct 28997 
Val Arg Asn Arg Glu Arg Phe Asp lie Pro Cys Cys Ser Phe Ser 
2015 2020 2025 

tac ata acg gaa gaa taa acactgtttt tcttttcaat gtttttattc 29045 
Tyr lie Thr Glu Glu 
2030 



tgctttttta 


cacagttega 


accgtcagac 


tccctceece. 


cttceacttc 


acccggtaca 


29105 


cctcccgctc 


cccctggatc 


gctgcgtaea 


actgcagttt 


ggtgttcaga 


eacgggttct 


29165 


taggtgacag 


tatccaeacg 


gcctctttgc 


cggccaggcg 


ctggtccgta 


atgctcacaa 


29225 


atccctccga 


cacgtcctcc 


agacacacgg 


tggaatccaa 


ggcgcccgtc 


tadaaaacaa 


29285 


acacagtcat 


gctctccacg 


ggttctctcc 


tcggtcgtac 


tgcgccagcg 


tgaacgggcg 


29345 


atggtgctcc 


atcagggctc 


gcagcaaccg 


ctgtcggcgc 


ggctcaccca 


ggcbc'bggcg 


29405 


aaaagcgccc 


cgtctgggag 


tgctattcaa 


aaaacgcacc 


gectttatca 


acagtctcct 


29465 


cgtgcggcgg 


gcgcagcagc 


gcacctggat 


ctctgtcagg 


tctttacaat 


aggtacagcc 


29525 


catcaccacc 


atgttgttta 


aaatCGcaaa 


gctaaacacg 


ctccacccaa 


atgacatgaa 


29585 


ttccagcacc 


gccgcggcgt 


ggccattcata 


caatatgcgg 


aggtaaatca. 


ggfgccgccc 


29645 


cctaatacaa 


acgctcccca 


tatacatcac 


ctecttagge 


agttgataat 


taaccacctc 


29705 


ccggtaccag 


ggaaacctca 


eg tt tac taa 


agecccaaac 


accaacattt 


taaaccagtt 


29765 


agccagcacc 


acccctcccg 


ccttkcactg 


cagegacccc 


ggctgtttae 


aatgaeagtg 


29825 


aatcacccac 


ctctcatacc 


ccctaatgae 


ctggcgtggc 


tecacatcta 


tagtagcaca 


298S5 


gcacacgcac 


accctcatgt 


aatgct'tcat 


caeaaatctt 


teccaagggg 


ttagtatcat 


29945 
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gecccagggt acgggccact cctgcagcac ggtgaaaggt acgcaggcgg gaacagtcct 30005 

cacctcggac acataatgca tattcagatg ttcacactct aaaaccccgg ggcttccctc 30065 

caacgcagcc actggcaagt tctcagaggg tggtgtaagg cggtggtgct gatiagggact 30125 

caatctgtgt cgacaccgtc tgtcgcgttg catcgtagac caacgcttgg cgcaccgcct 30X85 

cgtacttcgc- ccaaagaaaa cgggtgcgac gccaacacac ttccgcgtac cgtgggttcc 30245 

gcactcgagc tcgctcagtt ctcaacgcat aatgcagcca ttcctgtaat cqacacaaca 30305 

gtcgctcggc ttccaaagag atgtgcacct cgtatcttat aacgtcccga tatatatcca 30365 

agcaggcagt cagggccact tgcaac'cagt gcacgcaggc ggactgatcg cgacacactg 30425 

gaggtggagg gagagacgga agaggcatgt tactccagac ggtcgaaaag cggatcaaag 30485 

tgcagatcgc gaagatggca gcgatccccg ccgctacgct ggtgatagat cacagccagg 30545 

tcaaacataa tgcggttttc caaatgacct attaccgcct ccaccagagc cgcpacgcgc 30605 

acttccagaa acaccagcac ggctacggca ttctcctcaa aatcttcaaa cattaagctg 30665 

catgattgaa tcacccccaa ataattctcc tccttccatt ctcgcaaaat ttgagtaaaa 30725 

acctctcgca gattagctcc gtggcgttca aaaaggtcac ttagagcgcc ctccaccgcc 30785 

atgcgcaagc acaccctcat gattgaaaaa tgccagtctc ctgaaccacc tgcagttgat 30845 

ttaaaagacc tatattagga tcaattccac tctcccgcag ctccacgcgt agcattagct 30905 

gcaaaaagtc atttaaatct tcgcaaacta gcgcggtaag ctcgccgccg ggaattaggt 30965 

ctgaagcagt caccacacac ataatttcca gtgaaggagt cagtctaagc agcaaaaagc 31025 

cgcatgagca gtgttgaaaa ggaggggtca cgcaatgtaa catatgcagc caaaaatctc 31085 

caaggtgtct gtgcataaac tccaccactg aaaagtccaa atcatigtaaa tatgccatca 31145 

ccgcctcagg aaccaccacg gacaca'aaaa cgggccgtag caaatacatg gtgtcctgca 31205 

aagcaaaaac acatttatac catagaggcg cgaattactt ggggaaaaat cactcgctcc 31265 

aaaactaaac aggccaccgt ctgaccgcgc cagccataaa aaaagcggtt cgaatgatta 31325* 

aaaagaataa tagacacctc ccacc&ggta ctcggctgca actcgtgcgc ccctatcaaa 31385 

accccgcgga cgttcatgtc ggccatagaa aaaatgcggc ccaaatatcc caccggaatc 31445 

tccacggcca gctgcagtga tagcaaaaga acgccatgag gagcaatcac aaaattttca 31505 

ggcgataaaa gcacataaag gttagaatag ccctgctgca caggcaataa agcccgcgag 31565 

ctcagcaaat gcacataaac cgcttcagcc atcccgtctt: accgcgaaca aaaggctcac 31625 



149 



agtacacagt tactcaaccc acacgcfcaca cagtatttat acactcctca atcgccacgt 3X685 

cacccgcccc gaacaaactc caaaagtcca aaaagtccaa aacgcccgcg taaaagcccg 31745 

ccaaaacagc acttcctcat ttactctccc acagtacgtc acttccgccg cgcccgccgc 31805 

cctcgccccg ccctcaccct cgcgctccac cccgcgcccc acgtcagact cccaccccgc 31865 

cccgcgcccg cgtcatccgc accceaccct cactccaccc ctaaccccgc ctcc,fecatta 31925 

tcatattggc accgttccca aataaggtat attatgatga tg 31967 

<210> 13 

<211> 508 

<212> PRT 

<213> Simian adenovirus ' 

<400> 13 

Met Arg Arg Ala Val Gly Val* Pro Pro Val Met Ala Tyr Ala Glu Gly 
^ 5 10 15 , 

Pro Pro Pro Ser Tyr Glu Thr Val Met Gly Ala Ala Asp Ser Pro Ala 
20 ' 25 30 • 

Thr Leu Glu Ala Leu Tyr Vail Pro Pro Arg Tyr Leu Gly Pro Thr Glu 
3S . * 40 45 . 

Gly Arg Asn Ser He Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr 
50 55 60 

Thr Arg Val Tyr Leu Val Asp' Asn Lys Ser Ala Asp. He Ala Ser Leu 
" 70 . . .75 80 

Asn Tyr Gin Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gin .Asn 
85 -90 95 

Asn Asp Phe Thr Pro Val Glu. Ala Gly Thr Gin Thr He Asn PJie Asp 
100 105 110 

Glu Arg Ser Arg Trp Gly Gly Asp Leu Lys Thr He Leu Arg Thr Asn 
1" 120 125 

Met Pro Asn He Asn Glu Phe' Met Ser Thr Asn Lys Phe Arg Ala *Arg 
"0 135 

Leu Met Val Glu Lys Val Asn Lys Glu Thr Asn Ala Pro Arg Tyr Glu 

. . 150 



145 150 155 160 

Trp Phe Glu Phe Thr Leu Pro Glu Gly Asn Tyr Ser Glu Thr Met Thr 
165 170 175 



Xle Asp Zieu Met Asn Asn Ala- lie -Val Asp Asn Tyr Leu Glu Val Gly 
180 185 190 



Arg Gin Asn Gly Val Leu Glu Ser Asp lie Gly Val Lys Phe Asp Thr 
195 200 205 



Arg Asn Phe Arg Leu Gly Trp Asp Pro Val Thr Lys Leu Val Met Pro 
210 215 220 



Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp lie Val Leu Leu Pro 
225 230 . 235 240 

Gly Cys Gly val Asp Phe Thr Gin Ser Arg Leu Ser Asn Leu Leu Gly 
245 • 250 255 



lie Arg Lys Arg Met Pro Phe Gin Ala Gly Phe Gin lie Met Tyr Glu 
260 265 270 



Asp Leu Glu Gly Gly Asn lie Pro Ala Leu Leu Asp Val Ala Lys Tyr 
275 280 285 

Glu Ala ser lie Gin Lys Ala .Arg Glu Gin Gly Gin Glu He Arg Gly 
290 295 . 300 



Asp Asn Phe Thr Val He Pro Arg Asp Val Glu He Val Pro Val Glu 
305 310 315 *. 320 



Lys Asp Ser Lys Asp Arg Ser " Tyr Asn Leu Leu Pro Gly Asp Gin Thr 
325 330 335 

Asn Thr Ala Tyr Arg Ser Trp Phe Leu .Ala Tyr Asn Tyr Gly Asp Pro 
340 345 350* 



Glu Lys Gly Val Arg Ser -Trp Thr Leu Leu Thr Thr Thr Asp Val Thr 
355 360 365 



Cys Gly Ser Gin Gin Val -Tyr Trp Ser Leu Pro Asp Met Met Gin Asp 
370 375 380 



151 



Pro Val Thr Phe Arg Pro Ser Ser Gin Val ser Asn Tyr Pro Val Val 
385 390 • 395 400 



Gly Val Glu Leu Leu Pro Val His Ala Lys Ser Phe Tyr Asn Glu Gin 
405 410 415. 

Ala Val Tyr Ser Gin Leu lie Arg Gin Ser Thr Ala Leu Thr His Val 
420 425 430 

Phe Asn Arg Phe Pro Glu Asn Gin lie Leu Val Arg Pro Pro Ala Pro 
435 440 445 

Thr ile Thr Thr Val Ser Glii Asn Val Pro Ala Leu Thr Asp His Gly 
450 455 460 

Thr Leu Pro Leu Arg Ser Ser Ile Ser Gly Val Gin Arg Val Thr lie 
465 470 475 480 

Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val His Lys Ala Leu Gly 
485 490 4'95 

lie Val Ala Pro Lys Val Leu» Ser Ser Arg Thr Phe. 
500 505 



<210> 14 

<211> 917 

<212> PRT 

<213> Simian adenovirus / 

<400> 14 

Met Ala Thr Pro Ser Met Met Pro Gin Trp Ser Tyr Met His lie Ala 
1 5 ' 10 IS 

Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala 
20 25 • 30 

Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro 
35 • 40 45 

Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Oln Arg Leu 
SO 55 60 



1S2 



Thr Leu Arg Phe Val Pro Val Asp Arc, Glu Asp Thr Ala Tyr Ser Tyr 

''O 75 «1 



80 



Lys Val Arg Phe Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 



90 



95 



Ala Ser Thr Tyr Phe Asp lie Arg Gly Val Leu Asp Arg Gly Pro Ser 



105 



llO 



Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys' Gly 



120 



125 



Ala Pro Asn Pro Ser Glu Trp xys Gly Ser Asp Asn Lys lie Ser Val 



135 



140 



Arg Gly Gin Ala Pro Phe Phe Ser Thr Ser lie Thr Lys Asp Gly 'lie 



155 



160 



Gin val Ala Thr Asp Thr Ser Gly Ala Val Tyr Ala Lys Lys Glu 
165 X70 



175 



Tyr Oln Pro Glu Pro om Val Gly Gin Glu Gin Trp Asn Ser Glu 



185 



Al^ 



•190 



ser Asp ser Asp Lys Val Ala Gly Arg He Leu Lys Asp Thr Tht Pro 
•"•sa 200 



205 



Met Phe Pro Cys Tyr Gly Ser Tyr Ala Lys Pro Thr Asn Glu Gin Gly 



220 



Gly Gin Gly Thr Asn Thr Val Asp Leu Gin Phe Phe Ala Ser Ser Ser 

230 • 

Ala Thr Ser Thr Pro Lys Aid Val Leu Tyr Ala Glu Asp Val Ala He 
24S 250 2SS 

Glu Ala Pro Asp Thr His Leu Val Tyr Lys Pro Ala Val Thr Thr- Thr 

265 270 

Thr Thr Ser Ser Gin Asp Leu Leu Thr Gin Gin Ala Ala Pro Asn Arg 

280 285 

Pro Asn Tyr He Gly Phe Arg Asp Asn Phe He Gly Leu Met Tyr Tyr 



153 



290 295-. 300 

Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu 
305 310 315 • 320 

Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin 
325 330 335 



Leu Met Leu Asp Ala Leu Gly. Asp Axrg Ser Arg Tyr Phe Ser Met Trp 
340 345 350 



Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg lie lie Glu Asn 
355 '360 365 



His Gly Val Glu Asp Glu Leu pra Asn Tyr Cys Phe Pro Leu Gly Gly 
370 375 380 

Ser Leu Val Thr Glu Thr Tyr Thr Gly Leu Ser Pro Gin Asn Gly, Ser 
385 * * --390 • • • 3-95 400 

Asn Thr Trp Thr Thr Asp Ser Thr Thr Tyr Ala Thr Arg Gly Val Glu 
405 410 415 

He Gly Ser Gly Asn Met Phe Ala Met Glu He Asn Leu Ala Ala Asn 
420 ; . 425 430 • 



Leu Trp Arg Ser Phe Leu Tyr .Ser Asn Val Ala Leu Tyr Leu Pro Asp 
435 440 445 



Glu Tyr Lys Leu Thr Pro Asp Asn He Thr Leu Pro Asp Asn Lys Asn 
450 455 460 



Thr Tyr Asp Tyr Met Asn Gly Arg Val Ala Ala Pro Ser Ser Leu Asp 
465 470 475 ' 480 



Thr Tyr Val Asn He Gly Al$i Arg Trp Ser Pro Asp Pro Met Asp Asn 
485 • 490 495 



Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr. Arg Ser 
500 505 ' 510 . 



Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His He Gin Val Pro 
515 . 520 525 
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Gin Lys Phe Phe Ala He Lys Asn Leu Leu Leu Leu Pro Gly Ser Tyr 
530 535 540 



Thr Tyr Glu Trp Asn Phe Axg Lys Asp Va:l Asn Met He Leu Gin Ser 
545 550 555 -560 



Ser Leu Gly Asn Asp Leu Arg-Val Asp Gly Ala Ser Val Arg Phe Asp 
565 570 575 



Ser Zle Asn Leu Tyr Ala Phe Phe Pro Met Ala His Asn Thr Ala 
580 585 590 



Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gin Ser Phe 
595 .600 605 



Asn Asp ^r Leu Cys Ala Ala Asn Met Leu Tyr Pro He Pro Ala. Asn 
610 615 620 



Ala Thr Ser Val Pro He Ser He Pro Ser Arg Asn Trp Ala Ala 'Phe 
625 630 635 640 



Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr Lys Glu Thr Pro Ser Leu 
645 . 650 655 



Gly Ser Gly Phe Asp Pro Tyr Phe Thr Tyr Ser Gly Ser He Pro .Tyr 
660 665 670 



Leu Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser JCle 
675 680 685 



Met Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr 
690 695 . 700 



Pro Asn Glu Phe Glu He Lys Arg Thr Val Asp Gly Glu Gly Tyr Asn 
705 710 • 715 . 720 



Val Ala Gin Cys Asn Met Thr Lys Asp Trp Phe h&u He Gin Met lieu 
725 730 735 



Ser His Tyr Asn He Gly T>rr Gin Gly Phe Tyr Val Pro Glu Gly. Tyr 
740 ' 745. 750 
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Lys Asp Arg Met IVr Ser Ph^ pHe Arg Asn Phe Gin Pro Met Ser Arg 

765 

Gin Val Val Asp Thr Thr Thr Tyr Thr Asp Tyr Lys Aan Val Thr Leu 



775 



780 



Pro Phe Gin His Asn ^„ se.r Gly Phe Val Gly Tyr Met Gly Pro Thr 

Met Arg Glu Gly Gin Ala Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu He 
«05 810 815 



C 



Oly t.ys Thr Ala Val Pro Ser l.eu Thr Gin Lys Lys Phe Leu Cys ksp 

825 . • 

Arg Thr Met Trp Arg He Pro Phe Ser Ser Asn Phe Met Ser Met Oly 
"••^ 840 845 ■ • 

Ala Leu Thr Asp Leu Gly Gin Aan Met Leu Tyr Ala Asn Ser Ala His 

855 

Ala Leu ASP Met Thr Phe Glu Val Asp Pro Met Asp Glu Pro Thr Leu 

875 880 



Leu Tyr Val Leu Phe Glu Val Phe Asp Val Val Arg He His Gin' 



890 



Pro 



895- 



His Arg Gly Val Xle Glu Al:a Val Tyr Leu Arg Thr Pro Phe Ser Ala 

905 

Gly Asn Ala Thr Thr 
915 

<210> 15 • ■ . 

<211> 607 
<212> PRT 

<213> Simian adenovirus 
<400> 15 

Met Lys Arg Ala Arg Leu Asp Asp Asp Phe Asn Pro Val Tyr Pro Tyr 

10 X5 ■ 

Asp Thr pro Asn Ala Pro Ser Val Pro Phe He Thr Pro Pro Phe Val 



1S6 



20 25 30 * . 

Ser Ser Asp Gly Leu Gin GIU Lys Pro Pro Oly Met Leu Ser Leu Asm 
35 < 40 45 



Tyr Gin Asp Pro lie Thr Thr Gin Asn Gly Ala Leu Thr Leu Lys Leu 
50 55 60 



Gly Ser Gly Leu Asn lie Asn Gin Asp Gly Glu Leu Thr* Ser Asp Ala 
65 70 75 80 

Ser Val Leu Val Thr Pro Pro. He Thr Lys Ala Asn Asn Thr He Gly 
85 90 95 . 

Leu Ala Phe Asn Ala Pro Leii Thr Leu Gin Ser Asp Thr Leu Asn* Leu 
100 105 110 

Ala Cys Asn Ala Pro Leu Thr Val Gin Asp Asn Arg Leu Gly He Thr 
115 • 120 125 



Tyr Asn Ser Pro Leu Thr Leu Gin Asn Ser Glu Leu Ala Leu Ala Val 
130 135 ' 140 



Thr Pro Pro Leu Asp Thr Ala Asn Asn Thr Leu Ala Leu Lys Thr Ala 
145 150 • 155 160 

Arg Pro He He Thr Asn Ser Asn Asn Glu Leu Thr Leu Ser Ala Asp 
165 170 175 

Ala Pro Leu Asn Thr Ser Thr .Gly Thr Leu Arg Leu Gin Ser Ala Ala 
180 185 190 

Pro Leu Gly Leu Val Asp Gin Thr Leu Arg Val Leu Phe Ser Asn Pro 
195 200 205 . 

Lexi Tyr Leu Gin Asn Asn Phe Leu Ser Leu Ala He Glu Arg Pro Leu 
210 ^15 220 

Ala Leu Thr Thr Thr Gly Ser Met Ala Met Gin He Ser Gin Pro Leu 
225 230 235 240 

Lys Val Glu Asp Gly Ser Leu Ser Leu Ser He Glu Ser Pro Leu Asn 
245 250 255 • 
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Leu I*ys Asn Gly Asn Leu Thr Leu Gly Thr Gin Ser Pro Leu Thr Val 
260 265 270 



Thr Gly Asn Asn Leu Ser Leu Thr Thr Thr Ala Pro Leu Thr Val Gin 
275 ''280 285 



Asn Asn Ala X«eu Ala Leu Ser Val Leu Leu Pro Leu Arg Leu Phe Asn 
290 295 " 300 



Asn Thr Ser Leu Gly Val Ala Phe Asn Pro Pro He Ser Ser Ala Asn 
305 310 315 320 



Asn Gly Leu Ser Leu Asp He Gly Asn Gly Leu Thr Leu Gin Tyr Asn 
325 • 330 335 



Arg Leu Val Val Asn He Gly Gly Gly Leu Gin Phe Asn Asn Gly Ala 
340 345 350 



He Thr Ala Ser He Asn Ala Ala Leu Pro Leu Gin Tyr Ser Asxi Asn 
355 360 365 



Gin Leu Ser Leu Asn He Gly Gly Gly Leu Arg Tyr Asn Gly Thr* Tyr 
370 375 380 



Lys Asn Leu Ala Val Lys Thr Asp Ser Phe Arg Gly Leu Glu He Asp 
385 390 395 400 



Ser Asn Gin Phe Leu Val Pro Arg Leu Gly Ser Gly Leu Lys Phe ,Asp 
405 410 4'is * 



Gin Tyr Gly Tyr He Ser Val He Pro Pro Thr Val Thr Pro Thr Thr 
420 425 430 



Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Ala Thr Phe Tyr Asp Ser 
435 440 445 



Leu Asp Ala Lys Val Trp Leu Ala Leu Val Lys Cys Asn Gly Met Val 
450 455 460 



Asn Gly Thr He Ala He Lys Ala Leu Lys Gly Thr Leu Leu Gin' Pro 
465 470 " * 475 480 
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Thr Ala Ser Phe lie Ser Phe Val Met: Tyr Phe Tyr Ser Asn Gly Thr 
485 490 495 



Arg Arg Thr Asn Tyr Pro Tjir f he Glu Asn Glu Gly Zle Leu Ala Ser 
500' 505 510 



Ser Ala Thr Trp Gly Tyr Arg 'Gin Gly Asn Ser Ala Asn Thr Asn Val 
515 520 525 



Thr Ser Ala Val Glu Phe Met Pro Ser Ser Thr Arg Tyr Pro Val Asn 
530 535 540 



Iiys Gly Thr Glu Val Gin Asn Met Glu Leu Thr Tyr Thr Phe Leu Gin 
545 550 555 ' 560 



Gly Asp Pro Thr Met Ala lie Ser Phe Gin Ala lie Tyr Asn His Ala 
565 570 575. 



Leu Glu Gly Tyr Ser Leu Lys Phe Thr Trp Arg Val Arg Asn Arg Glu 
580 585 590 



Arg Phe Asp lie Pro Cys Cys Ser Phe Ser Tyr lie Thr Glu Glu 
595 600 605 . 
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